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5 

ACTIVATION OF HCV-SPECIHC T CELLS 

TECHNICAL AREA OF THE INVENTION 

1 0 The invention relates to the activation of hepatitis C virus(HCV)-specific T 

cells. More particularly, the invention relates to the use of multiple HCV 
polypeptides, either alone or as fusions, to stimulate cell-mediated immune responses, 
such as to activate HCV-specific T cells. 

1 5 BACKGROUND OF THE INVENTION 

Hepatitis C virus (HCV) infection is an important health problem with 
approximately 1% of the world's population infected with the virus. Over 75% of 
acutely infected individuals eventually progress to a chronic carrier state that can 
result in cirrhosis, liver failure, and hepatocellular carcinoma See Alter et al (1992) 

20 N. Engl. J. Med. 327: 1 899-1 905; Resnick and Koff. (1 993) Arch. Intern. Med. 

153:1672-1677; Seeff (1995) Gastrointest Dis. 6:20-27; Tong et al (1995) N. Engl. J. 
Med. 332:1463-1466. 

Despite extensive advances in the development of pharmaceuticals against 
certain viruses like HIV, control of acute and chronic HCV infection has had limited 

25 success (Hoofhagle and di Bisceglie (1997) N. Engl. J. Med. 336:347-356). In 

particular, generation of a strong cytotoxic T lymphocyte (CTL) response is thought 
to be important for the control and eradication of HCV infections. Thus, there is a 
need in the art for effective methods of inducing strong CTL responses against HCV. 
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SUMMARY OF THE INVENTION 

It is an object of the invention to provide reagents and methods for stimulating 
immune responses, such as activating T cells which recognize epitopes of HCV 
polypeptides. This and other objects of the invention are provided by one or more of 

5 the embodiments described below. 

The invention provides HCV proteins useful for stimulating immune 
responses, such as activating HCV-specific T cells. One embodiment provides a 
fusion protein that comprises HCV polypeptides, wherein the HCV polypeptides 
consist essentially of anNS3, anNS4, anNSSa polypeptide, and optionally a core 

10 polypeptide. In certain embodiments, the fusion protein includes an NS5b 
polypeptide. 

In certain embodiments, at least one of the HCV polypeptides is . derived from 
a different strain of HCV than the other polypeptides. 

The invention also provides compositions comprising any of these fusion 

1 5 proteins and a pharmaceutical^/ acceptable excipient. In certain embodiments, the 
compositions further comprise an adjuvant, a CpG polynucleotide and/or the fusion 
protein is adsorbed to or entrapped within a microparticle or ISCOM. The 
compositions can further comprise a polynucleotide encoding an E1E2 complex. The 
E1E2 polynucleotide can also be adsorbed to or entrapped withing a microparticle. 

20 Another embodiment provides a composition comprising HCV polypeptides 

and a pharmaceutically acceptable excipient. The HCV polypeptides consist 
essentially of anNS3, anNS4, an NS5a polypeptide, and optionally a core 
polypeptide. In certain embodiments, the composition includes an NS5b polypeptide. 
In other embodiments, the compositions further comprise an adjuvant, a CpG 

25 polynucleotide and/or one or more of the HCV polypeptides is adsorbed to or 

entrapped within a microparticle or ISCOM. The compositions can further comprise a 
polynucleotide encoding an E1E2 complex. The E1E2 polynucleotide can also be 
adsorbed to or entrapped withing a microparticle. 

Moreover, one of the HCV polypeptides may be derived from a different strain of 
3 0 HCV than the others. 
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Even another embodiment of the invention provides an isolated and purified 
polynucleotide which encodes a fusion protein as described above. In additional 
embodiments, the fusion proteins further include a polynucleotide encoding an E1E2 
complex. 

5 Yet another embodiment of the invention provides a composition comprising 

the polynucleotides described above and a pharmaceutically acceptable excipient. In 
certain embodiments, the compositions further comprise an adjuvant and/or the 
polynucleotide may be adsorbed to or entrapped within a microparticle. The 
compositions can further comprise a polynucleotide encoding an E1E2 complex. The 

10 E1E2 polynucleotide can also be adsorbed.to or entrapped withing a microparticle. 

In a further embodiment, the invention provides a composition comprising 
HCV polynucleotides and a pharmaceutically acceptable excipient, wherein the HCV 
polynucleotides consist essentially of polynucleotides encoding an NS3, anNS4, an 
NS5a polypeptide, and optionally a core polypeptide. In certain embodiments, the 

1 5 composition also includes a polynucleotide encoding an NS5b polypeptide. The 
compositions may further comprise an adjuvant and/or one or more of the 
polynucleotides may be adsorbed to or entrapped within a microparticle. The 
compositions can further comprise a polynucleotide encoding an E1E2 complex. The 
E1E2 polynucleotide can also be adsorbed to or entrapped withing a microparticle. 

20 Additionally, one or more of the polynucleotides may be derived from a different 
strain of HCV than the others. 

In another embodiment, the invention provides a method of activating T cells 
which recognize an epitope of an HCV polypeptide. T cells are contacted with any of 
the fusions, polynucleotides or compositions described above. A population of 

25 activated T cells recognizes an epitope of the NS3, NS4, NS5a, NS5b, core and/or 
E1E2 polypeptide. 

In the proteins and polynucleotides above, the regions in the fusions need not 
be in the order in which they naturally occur in the native HCV polyprotein. Thus, for 
example, the NS5b polypeptide, if present, may be at the N- and/or C-terminus of the 
30 fusion, or may be located internally. Similarly, the El polypeptide may precede or 
follow the E2 polypeptide. The E1E2 polypeptide may also be part of the 
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nonstructural fusion protein or may be provided separately, as an El E2 complex, or as 

individual polypeptides. 

Moreover, the NS3 polypeptide may include a modification to inhibit protease 

activity, such that cleavage of the fusion is inhibited. Such modifications are 
5 described more fully below. Additionally, the compositions can comprise more than 

one HCV nonstructural fusion protein, such as a fusion protein with NS3, NS4 and 

NS5a, and a fusion protein with NS3, NS4, NS5a, NS5b and E1E2. The E1E2 

complexes, whether present separately or as part t of the fusion, can have varying E1E2 

polypeptides (described more folly -below). 
10 In certain embodiments, the nonstructural fusion protein consists of, from the 

amino terminus to the carboxyl terminus, an NS3, an NS4, an NS5a and, optionally, 

an NS5b polypeptide and the E1E2 complex consists of, from amino terminus to the 

carboxyl terminus, an El polypeptide and an E2 polypeptide. 

The various polypeptides (and polynucleotides encoding therefor) are derived 
1 5 from the same HCV isolate, or from different strains and isolates including isolates 

having any of the various HCV genotypes; to provide increased protection against a 

broad range of HCV genotypes. 

Yet another embodiment of the invention provides a method of stimulating an 

immune response, such as a cellular immune response, in a vertebrate subject by 
20 administering a composition as described herein. In certain embodiments, the 

composition activates T cells which recognize an epitope of an HCV polypeptide. T 

cells are contacted with a composition as described above. A population of activated 

T cells recognizes an epitope of one or more of the HCV polypeptide(s). 

The invention thus provides methods and reagents for stimulating immune 
25 responses to HCV, such as for activating T cells which recognize epitopes of HCV 

polypeptides. These methods and reagents are particularly advantageous for 

identifying epitopes of HCV polypeptides associated with a strong CTL response and 

for immunizing mammals, including humans, against HCV. 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a diagrammatic representation of the HCV genome, depicting the 
various regions of the HCV polyprotein. 

Figure 2 (SEQ ED NOS: 9 and 10) depicts the DNA and corresponding amino 
5 acid sequence of a representative native NS3 protease domain. 

Figures 3 A-3C (SEQ ID NOS:3 and 4) shows the nucleotide and 
corresponding amino acid sequence for the HCV-1 El/E2/p7 region. The numbers 
shown in the figure are relative to the full-length HCV-1 polyprotein. The El , E2 and 
p7 regions are shown. 
1 0 Figure 4 is a diagram of plasmid pMHElE2-809, encoding E1E2 809 , a 

representative E1E2 protein for use with the present invention. 

Figures 5A-5J (SEQ ID NOS:7 and 8) depict the DNA and corresponding 
amino acid sequence of a representative NS345Core fusion protein. The depicted 
sequence includes amino acids 1242-301 1 of the HCV polyprotein (representing 
15 polypeptides from NS3, NS4, NS5a and NS5b) with amino acids 1-121 of the HCV 
polyprotein (representing a polypeptide from the core region) fused to the C-tenninus 
of NS5b. This numbering is relative to the HCV-1 polyprotein. 

Figure 6 shows a side-by-side comparison of IFN-y expression generated in 
animals in response to delivery of alphavirus constructs encoding NS3NS4NS5a. 
20 Figure 7 shows IFN-y expression generated in animals in response to delivery 

of plasmid DNA encoding NS3NS4NS5a ("naked"), PLG-linked DNA encoding 
NS3NS4NS5a C'PLG), separate DNA plasmids encoding NS5a, NS34a, and NS4ab 
("naked"), and PLG-linked DNA encoding NS5a, NS34a, and NS4ab ("PLG"). 
Figure 8 shows HCV-specific CD8+ and CD4+ responses in vaccinated 
25 chimpanzees. 

Figure 9 depicts the specificity of T cell responses primed by electroporation 
of plasmid DNA two weeks subsequent to the third immunization. 

Figure 10 shows the specificity of T cell responses primed by vaccinating 
chimpanzees with NS345Core, 21 -ISCOMS two weeks subsequent to the third 
30 immunization. 
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DETAILED DESCRIPTION OF THE INVENTION 

The practice of the present invention will employ, unless otherwise indicated, 
conventional methods of chemistry, biochemistry, recombinant DNA techniques and 
immunology, within the skill of the art Such techniques are explained fully in the 
5 literature. See, e.g., Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd 
Edition); Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, 
Inc.); DNA Cloning, Vols. I and II (D.N, Glover ed.); Oligonucleotide Synthesis (M.J. 
Gait ed.); Nucleic Acid Hybridization (B.D. Haines & SJ. Higgins eds.); Animal Cell 
Culture (R.K, Freshney ed.); Perbal, B., A Practical Guide to Molecular Cloning. 
10 It must be noted that, as used in this specification and the appended claims, the 

singular forms "a", "an" and "the 11 include plural referents unless the content clearly 
dictates otherwise. Thus, for example, reference to "an antigen" includes a mixture of 
two or more antigens, and the like. 

The following amino acid abbreviations are used throughout the text: 
15 Alanine: Ala (A) Arginine: Arg (R) 

Asparagine: Asn (N) Aspartic acid: Asp (D) 

Cysteine: Cys (C) Glutamine: Gin (Q) 

Glutamic acid: Glu (E) Glycine: Gly (G) 
Histidine: His (H) Isoleucine: lie (I) 

20 Leucine: Leu (L) Lysine: Lys (K) 

Methionine: Met (M) Phenylalanine: Phe (F) 

Proline: Pro (P) Serine: Ser (S) 

Threonine: Thr (T) Tryptophan: Trp (W) 

Tyrosine: Tyr (Y) Valine: Val (V) 

25 

I Definitions 

In describing the present invention, the following terms will be employed, and 
are intended to be defined as indicated below. 

The terms "polypeptide" and '^protein" refer to a polymer of amino acid 
30 residues and are not limited to a minimum length of the product. Thus, peptides, 
oligopeptides, dimers, multimers, and the like, are included within the definition. 
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Both full-length proteins and fragments thereof are encompassed by the definition. 
The terms also include postexpression modifications of the polypeptide, for example, 
glycosylation, acetylation, phosphorylation and the like. Furthermore, for purposes of 
the present invention, a "polypeptide" refers to a protein which includes 
5 modifications, such as deletions, additions 1 and substitutions (generally conservative in 
nature), to the native sequence, so long as the protein maintains the desired activity. 
These modifications may be deliberate, as through site-directed mutagenesis, or may 
be accidental, such as through mutations of hosts which produce the proteins or errors 
due to PCR amplification. 

10 An HCV polypeptide is a polypeptide, as defined above, derived from the 

HCV polyprotein. The polypeptide need not be physically derived from HCV, but 
may be synthetically or recombinaritly produced Moreover, the polypeptide may be 
derived from any of the various HCV strains and isolates including isolates having any 
of the 6 genotypes of HCV described in Simmonds et al., 1 Gen* Virol (1993) 

15 74:2391-2399 (e.g., strains 1, 2, 3, 4 etc.), as well as newly identified isolates, and 
subtypes of these isolates, such as HCVla, HCVlb, etc. A number of conserved and 
variable regions are known between these strains and, in general, the amino acid 
sequences of epitopes derived from these regions will have a high degree of sequence 
homology, e.g., amino acid sequence homblogy of more than 30%, preferably more 

20 than 40%, when the two sequences are aligned. Thus, for example, the term "NS4" 
polypeptide refers to native NS4 from any of the various HCV strains, as well as NS4 
analogs, muteins and immunogenic fragments, as defined further below. 

By an "El polypeptide" is meant a molecule derived from an HCV El region. 
The mature El region of HCV- 1 begins at approximately amino acid 192 of the 

25 polyprotein and continues to approximately amino acid 383, numbered relative to the 
full-length HCV-1 polyprotein. (See, Figures 1 and 3A-3C. Amino acids 192-383 of 
Figures 3A-3C correspond to amino acid positions 20-21 1 of SEQ ID NO:4.) Amino 
acids at around 173 through approximately 191 (amino acids 1-19 of SEQ ID NO: 4) 
serve as a signal sequence for El . Thus, by an "El polypeptide" is meant either a 

30 precursor El protein, including the signal sequence, or a mature El polypeptide which 
lacks this sequence, or even an El polypeptide with a heterologous signal sequence. 
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The El polypeptide includes a C-terminal membrane anchor sequence which occurs at 
approximately amino acid positions 360-383 (see, International Publication No. WO 
96/04301, published February 15, 1996), An El polypeptide, as defined herein, may 
or may not include the C-terminal anchor sequence or portions thereof. 
5 By an "E2 polypeptide" is meant a molecule derived from an HCV E2 region. 

The mature E2 region of HCV-1 begins at approximately amino acid 383-385, 
numbered relative to the full-length HCV-1 polyprotein. (See, Figures 1 and 3A-3C. 
Amino acids 383-385 of Figures 3A-3C correspond to amino acid positions 21 1-213 
of SEQ ID NO:4.) A signal peptide begins at approximately amino acid 364 of the 

10 polyprotein. Thus, by an "E2 polypeptide" is meant either a precursor E2 protein, ' 
including the signal sequence, or a mature E2 polypeptide which lacks this sequence, 
or even an E2 polypeptide with a heterologous signal sequence. The E2 polypeptide 
includes a C-terminal membrane anchor sequence which occurs at approximately 
amino acid positions 715-730 and may extend as far as approximately amino acid 

15 residue 746 (see, Lin et al, J. Virol (1994) 68:5063-5073). An E2 polypeptide, as 
defined herein, may or may not include the C-terminal anchor sequence or portions 
thereof. Moreover, an E2 polypeptide may also include all or a portion of the p7 
region which occurs immediately adjacent to the C-terminus of E2. As shown in 
Figures 1 and 3 A-3C, the p7 region is found at positions 747-809, numbered relative 

20 to the full-length HCV-1 polyprotein (amino acid positions 575-637 of SEQ ID 

NO:4). Additionally, it is known that multiple species of HCV E2 exist (Spaete et al., 
Virol (1992) 188:819-830; Selby et al., 1 Virol (1996) 20:5177-5182; Grakoui et al., 
J. Virol (1993) 67:1385-1395; Tomei et al., J. Virol (1993)67:4017-4026). 
Accordingly, for purposes of the present invention, the term "E2" encompasses any of 

25 these species of E2 including, without limitation, species that have deletions of 1-20 
or more of the amino acids from the N-terminus of the E2, such as, e.g, deletions of 1, 
2, 3, 4, 5.. ..10.. .15, 16, 17, 18, 19... etc. amino acids. Such E2 species include those 
beginning at amino acid 387, amino acid 402, amino acid 403, etc. 

Representative El and E2 regions from HCV-1 are shown in Figures 3A-3C 

30 and SEQ ID NO:4. For purposes of the present invention, the El and E2 regions are 

defined with respect to the amino acid number of the polyprotein encoded by the 

t 
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genome of HCV- 1 , with the initiator methionine being designated position 1 . See, 
e.g., Choo et al., Proa Natl Acad Sci. USA (1991) £8:2451-2455. However, it 
should be noted that the term an "El polypeptide" or an "E2 polypeptide" as used 
herein is not limited to the HCV-1 sequence. In this regard, the corresponding El or 

5 E2 regions in other HCV isolates can be readily determined by aligning sequences 
from the isolates in a manner that brings the sequences into maximum alignment. 
This can be performed with any of a number of computer software packages, such as 
ALIGN 1.0, available from the University of Virginia, Department of Biochemistry 
(Attn: Dr. William R. Pearson). See, Pearson et al., Proc. Natl Acad Set USA (1988) 

10 85:2444-2448. 

Furthermore, an "El polypeptide" or an "E2 polypeptide" as defined herein is 
not limited to a polypeptide having the exact sequence depicted in the Figures. 
Indeed, the HCV genome is in a state of constant flux in vivo and contains several 
variable domains which exhibit relatively high degrees of variability between isolates. 

15 A number of conserved and variable regions are known between these strains and, in 
general, the amino acid sequences of epitopes derived from these regions will have a 
high degree of sequence homology, e.g., amino acid sequence homology of more than 
30%, preferably more than 40%, more than 60%, and even more than 80-90% 
homology, when the two sequences are aligned. It is readily apparent that the terms 

20 encompass El and E2 polypeptides from any of the various HCV strains and isolates 
including isolates having any of the 6 genotypes of HCV described in Simmonds et 
al., 1 Gen, Virol (1993) 74:2391-2399 (e.g., strains 1, 2, 3, 4 etc.), as well as newly 
identified isolates, and subtypes of these isolates, such as HCVla, HCVlb etc. 

Thus, for example, the term "El H or "E2" polypeptide refers to native El or E2 

25 sequences from any of the various HCV strains, as well as analogs, muteins and 

immunogenic fragments, as defined further below. The complete genotypes of many 
of these strains are known. See, e.g., U.S. Patent No. 6,150,087 and GenBank 
Accession Nos. AJ238800 and AJ238799. 

Additionally, the terms "El polypeptide" and "E2 polypeptide" encompass 

30 proteins which include modifications to the native sequence, such as internal 
deletions, additions and substitutions (generally conservative in nature). These 
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modifications may be deliberate, as through site-directed mutagenesis, or may be 
accidental, such as through naturally occurring mutational events. All of these 
modifications are encompassed in the present invention so long as the modified El 
and E2 polypeptides function for their intended purpose. Thus, for example, if the El 
5 and/or E2 polypeptides are to be used in vaccine compositions, the modifications must 
be such that immunological activity (i.e., the ability to elicit a humoral or cellular 
immune response to the polypeptide) is not lost. 

By "E1E2" complex is meant a protein containing at least one El polypeptide 
and at least one E2 polypeptide, as described above. Such a complex may also 

1 0 include all or a portion of the p7 region which occurs immediately adjacent to the C- 
terminus of E2. As shown in Figures 1 and 3 A-3C, the p7 region is found at positions 
747-809, numbered relative to the full-length HCV-1 polyprotein (amino acid 
positions 575-637 of SEQ ID NO:4). A representative E1E2 complex which includes 
the p7 protein is termed <C E1E2 809 " herein. 

15 The mode of association of El and E2 in an E1E2 complex is immaterial. The 

El and E2 polypeptides may be associated through non-covalent interactions such as 
through electrostatic forces, or by cbvalent bonds. For example, the E1E2 
polypeptides of the present invention may be in the form of a fusion protein which 
includes an immunogenic El polypeptide and an immunogenic E2 polypeptide, as 

20 defined above. The fusion may be expressed from a polynucleotide encoding an E1E2 
chimera. Alternatively, E1E2 complexes may form spontaneously simply by mixing 
El and E2 proteins which have been produced individually. Similarly, when co- 
expressed and secreted into media, the El and E2 proteins can form a complex 
spontaneously. Thus, the term encompasses E1E2 complexes (also called aggregates) 

25 that spontaneously form upon purification of El and/or E2. Such aggregates may 
include one or more El monomers in association with one or more E2 monomers. 
The number of El and E2 monomers present need not be equal so long as at least one 
El monomer and one E2 monomer are present Detection of the presence of an E1E2 
complex is readily determined using standard protein detection techniques such as 

30 polyacrylamide gel electrophoresis and immunological techniques such as 
immunoprecipitation. 

10 
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The terms "analog" and "mutein" refer to biologically active derivatives of the 
reference molecule, or fragments of such derivatives, that retain desired activity, such 
as the ability to stimulate a cell-mediated immune response, as defined below. In 
general, the term "analog" refers to compounds having a native polypeptide sequence 
5 and structure with one or more amino acid additions, substitutions (generally 

conservative in nature) and/or deletions, relative to the native molecule, so long as the 
modifications do not destroy immunogenic activity. The term "mutein" refers to 
peptides having one or more peptide mimics ("peptoids"), such as those described in 
International Publication No. WO 91/04282. Preferably, the analog or mutein has at 
1 0 least the same immunoactivity as the native molecule. Methods for making 

polypeptide analogs and muteins are known in the art and are described further below. 

As explained above, analogs generally include substitutions that are 
conservative in nature, i.e., those substitutions that take place within a family of amino 

1 5 acids that are related in their side chains. Specifically, amino acids are generally 
divided into four families: (1) acidic ~ aspartate and giutamate; (2) basic - lysine, 
arginine, histidine; (3) non-polar - alanine, valine, leucine, isoleucine, proline, 
phenylalanine, methionine, tryptophan; and (4) uncharged polar glycine, asparagine, 
giutamine, cysteine, serine threonine, tyrosine. Phenylalanine, tryptophan, and 

20 tyrosine are sometimes classified as aromatic amino acids. For example, it is 

reasonably predictable that an isolated replacement of leucine with isoleucine or 
valine, an aspartate with a giutamate, a threonine with a serine, or a similar 
conservative replacement of an amino acid with a structurally related amino acid, will 
not have a major effect on the biological activity. For example, the polypeptide of 

25 interest may include up to about 5-10 conservative or non-conservative amino acid 
substitutions, or even up to about 15-25 conservative or non-conservative amino acid 
substitutions, or any integer between 5-25, so long as the desired function of the 
molecule remains intact. One of skill in the art may readily determine regions of the 
molecule of interest that can tolerate change by reference to Hopp/Woods and Kyte- 

30 Doolittle plots, well known in the art. 
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By "modified NS3 n is meant an NS3 polypeptide with a modification such that 
protease activity of the NS3 polypeptide is disrupted. The modification can include 
one or more amino acid additions, substitutions (generally non-conservative in nature) 
and/or deletions, relative to the native molecule, wherein the protease activity of the 
5 NS3 polypeptide is disrupted. Methods of measuring protease activity are discussed 
further below. 

By "fragment" is intended a polypeptide consisting of only a part of the intact 
ftdWength polypeptide sequence and structure. The fragment can include a C- 
terminal deletion and/or an N-terminal deletion of the native polypeptide. An 

10 "immunogenic fragment? ' of a particular HCV protein will generally include at least 
about 5-10 contiguous amino acid residues of the full-length molecule, preferably at 
least about 15-25 contiguous amino acid residues of the full-length molecule, and 
most preferably at least about 20-50 or more contiguous amino acid residues of the 
full-length molecule, that define an epitope, or any integer between 5 amino acids and 

15 the full-length sequence, provided that the fragment in question retains immunogenic 
activity, as measured by the assays described herein. 

The term "epitope" as used herein refers to a sequence of at least about 3 to 5, 
preferably about 5 to 10 or 15, and not more than about 1,000 amino acids (or any 
integer therebetween), which define a sequence that by itself or as part of a larger 

20 sequence, binds to an antibody generated in response to such sequence. There is no 
critical upper limit to the length of the fragment, which may comprise nearly the full- 
length of the protein sequence, or even a fusion protein comprising two or more 
epitopes from the HCV polyprotein. An epitope for use in the subject invention is not 
limited to a polypeptide having the exact sequence of the portion of the parent protein 

25 from which it is derived. Indeed, viral genomes are in a state of constant flux and 
contain several variable domains which exhibit relatively high degrees of variability 
between isolates. Thus the term "epitope" encompasses sequences identical to the 
native sequence, as well as modifications to the native sequence, such as deletions, 
additions and substitutions (generally conservative in nature). 

30 Regions of a given polypeptide that include an epitope can be identified using 

any number of epitope mapping techniques, well known in the art. See, e.g., Epitope 
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Mapping Protocols in Methods in Molecular Biology, Vol 66 (Glenn E. Morris, Ed., 
1996) Humana Press, Totowa, New Jersey. For example, linear epitopes may be 
determined by e.g., concurrently synthesizing large numbers of peptides on solid 
supports, the peptides corresponding to portions of the protein molecule, and reacting 
5 the peptides with antibodies while the peptides are still attached to the supports. Such 
techniques are known in the art and described in, e.g., U.S. Patent No. 4,708,871; 
Geysen et al. (1984) Proc. Natl Acad. Set USA 81 :39984002; Geysen et al. 
(1986) Molec. Immunol 23:709-715. Similarly, conformational epitopes are readily 
identified by determining spatial conformation of amino acids such as by, e.g., x-ray 

10 crystallography and 2-dimensional nuclear magnetic resonance. See, e.g., Epitope 
Mapping Protocols, supra. Antigenic regions of proteins can also be identified using 
standard antigenicity and hydropathy plots, such as those calculated using, e.g., the 
Omiga version 1 .0 software program available from the Oxford Molecular Group. 
This computer program employs the Hopp/Woods method, Hopp et al., Proc. Natl 

15 Acad Sci USA (1981) 78:3824-3828 for determining antigenicity profiles, and the 

Kyte-Doolittle technique, Kyte et al., J. Mol Biol (1982) 157: 105-132 for hydropathy 
plots. 

For a description of various HCV epitopes, see, e.g., Chien et al., Proc. Natl. 
Acad Scl USA (1992) 82:10011-10015; Chien et al., J. Gastroent Hepatol (1993) 
20 8:S33-39; Chien et al., International Publication No. WO 93/00365; Chien, D.Y., 
International Publication No. WO 94/01778; and U.S. Patent Nos. 6,280,927 and 
6,150,087. 

As used herein, the term "conformational epitope" refers to a portion of a full- 
length protein, or an analog or mutein thereof, having structural features native to the 

25 amino acid sequence encoding the epitope within the full-length natural protein. 
Native structural features include, but are not limited to, glycosylation and three 
dimensional structure. Preferably, a conformational epitope is produced 
recombinantly and is expressed in a cell from which it is extractable under conditions 
which preserve its desired structural features, e.g. without denaturation of the epitope. 

30 Such cells include bacteria, yeast, insect, and mammalian cells. Expression and 
isolation of recombinant conformational epitopes from the HCV polyprotein are 
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described in e.g., International Publication Nos. WO 96/04301, WO 94/01778, WO 
95/33053, WO 92/08734. 

As used herein the term "T-cell epitope" refers to a feature of a peptide 
structure which is capable of inducing T-cell immunity towards the peptide structure 
5 or an associated hapten. T-cell epitopes generally comprise linear peptide 

determinants that assume extended conformations within the peptide-binding cleft of 
MHC molecules, (Unanue et al., Science (1987) 236:551-557). Conversion of 
polypeptides to MHC class H-associated linear peptide determinants (generally 
between 5-14 amino acids in length) is termed "antigen processing" which is carried 

10 out by antigen presenting cells (APCs). More particularly, a T-cell epitope is defined 
by local features of a short peptide structure, such as primary amino acid sequence 
properties involving charge and hydrophobicity, and certain types of secondary 
structure, such as helicity, that do not depend on the folding of the entire polypeptide. 
Further, it is believed that short peptides capable of recognition by helper T-cells are 

1 5 generally amphipathic structures comprising a hydrophobic side (for interaction with 
the MHC molecule) and a hydrophilic side (for interacting with the T-cell receptor), 
(Margalit et al., Computer Prediction of T-cell Epitopes, New Generation Vaccines 
Marcel-Dekker, Inc, ed. G.C. Woodrow et al., (1990) pp. 109-1 16) and further that the 
amphipathic structures have an a-helical configuration (see, e.g., Spouge et al., J. 

20 Immunol (1987) 138:204-212: Berkower et al., J. Immunol (1986) 136:2498-2503). 
Hence, segments of proteins that include T-cell epitopes can be readily 
predicted using numerous computer programs. (See e.g., Margalit et al., Computer 
Prediction of T-cell Epitopes, New Generation Vaccines Marcel-Dekker, Inc, ed. G.C. 
Woodrow et al., (1990) pp. 109-1 16). Such programs generally compare the amino 

25 acid sequence of a peptide to sequences known to induce a T-cell response, and search 
for patterns of amino acids which are believed to be required for a T-cell epitope. 

An "immunological response" to an HCV antigen (including both polypeptide 
and polynucleotides encoding polypeptides that are expressed in vivo) or composition 
is the development in a subject of a humoral and/or a cellular immune response to 

30 molecules present in the composition of interest For purposes of the present 

invention, a "humoral immune response" refers to an immune response mediated by 
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antibody molecules, while a "cellular immune response" is one mediated by T- 
lymphocytes and/or other white blood cells. One important aspect of cellular 
immunity involves an antigen-specific response by cytolytic T-cells ("CTLs"). CTLs 
have specificity for peptide antigens that are presented in association with proteins 

5 encoded by the major histocompatibility complex (MHC) arid expressed on the 
surfaces of cells. CTLs help induce and promote the intracellular destruction of 
intracellular microbes, or the lysis of cells infected with such microbes. Another 
aspect of cellular immunity involves an antigen-specific response by helper T-cells. 
Helper T-cells act to help stimulate the function, and focus the activity of, nonspecific 

10 effector cells against cells displaying peptide antigens in association with MHC 
molecules on their surface. A "cellular immune response" also refers to the 
production of cytokines, chemokines and other such molecules produced by activated 
T-cells and/or other white blood cells, including those derived from CD4+ and CD8+ 
T-cells. 

15 A composition or vaccine that elicits a cellular immune response may serve to 

sensitize a vertebrate subject by the presentation of antigen in association with MHC 
molecules at the cell surface. The cell-mediated immune response is directed at, or 
near, cells presenting antigen at their surface. In addition, antigen-specific T- 
lymphocytes can be generated to allow for the future protection of an immunized host. 

20 

The ability of a particular antigen to stimulate a cell-mediated immunological 
response may be determined by a number of assays, such as by lymphoproliferation 
(lymphocyte activation) assays, CTL cytotoxic cell assays, or by assaying for T- 
lymphocytes specific for the antigen in a sensitized subject. Such assays are well 

25 known in the art. See, e.g., Erickson et al., J. Immunol (1993) 151:4189-4199; Doe et 
al., Eur. J. Immunol (1994) 24:2369-2376; and the examples below. 

Thus, an immunological response as used herein may be one which stimulates 
the production of CTLs, and/or the production or activation of helper T- cells. The 
antigen of interest may also elicit an antibody-mediated immune response. Hence, an 

30 immunological response may include one or more of the following effects: the 

production of antibodies by B-cells; and/or the activation of suppressor T-cells and/or 
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y8 T-cells directed specifically to an antigen or antigens present in the composition or 
vaccine of interest. These responses may serve to neutralize infectiyity, and/or 
mediate antibody-complement, or antibody dependent cell cytotoxicity (ADCC) to 
provide protection or alleviation of symptoms to an immunized host. Such responses 
5 can be determined using standard immunoassays and neutralization assays, well 
known in the art. 

By "equivalent antigenic determinant' ' is meant an antigenic determinant from 
different sub-species or strains of HCV, such as from strains 1, 2, 3, etc., of HCV 
which antigenic determinants are not necessarily identical due to sequence variation, 

10 but which occur in equivalent positions in the HCV sequence in question. In general 
the amino acid sequences of equivalent antigenic determinants will have a high degree 
of sequence homology, e.g., amino acid sequence homology of more than 30%, 
usually more than 40%, such as more than 60%, and even more than 80-90% 
homology, when the two sequences are aligned. 

15. A "coding sequence" or a sequence which "encodes" a selected polypeptide, is 

a nucleic acid molecule which is transcribed (in the case of DNA) and translated (in 
the case of mRNA) into a polypeptide in vitro or in vivo when placed under the 
control of appropriate regulatory sequences. The boundaries of the coding sequence 
are determined by a start codon at the 5* (amino) terminus and a translation stop codon 

20 at the 3' (carboxy) terminus. A transcription termination sequence may be located 3 ! 
to the coding sequence. 

A "nucleic acid" molecule or "polynucleotide" can include both double- and 
single-stranded sequences and refers to, but is not limited to, cDNA from viral, 
procaryotic or eucaryotic mRNA, genomic DNA sequences from viral (e.g. DNA 

25 viruses and retroviruses) or procaryotic DNA, and especially synthetic DNA 

sequences. The term also captures sequences that include any of the known base 
analogs of DNA and RNA. 

An "HCV polynucleotide" is a polynucleotide that encodes an HCV 
polypeptide, as defined above. 

30 "Operably linked" refers to an arrangement of elements wherein the 

components so described are configured so as to perform their desired function. Thus, 
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a given promoter operably linked to a coding sequence is capable of effecting the 
expression of the coding sequence when the proper transcription factors, etc., are 
present The promoter need not be contiguous with the coding sequence, so long as it 
functions to direct the expression thereof. Thus, for example, intervening untranslated 

5 yet transcribed sequences can be present between the promoter sequence and the 
coding sequence, as can transcribed introns, and the promoter sequence can still be 
considered "operably linked" to the coding sequence. 

"Recombinant" as used herein to describe a nucleic acid molecule means a 
polynucleotide of genomic, cDNA, viral, semisynthetic, or synthetic origin which, by 

1 0 virtue of its origin or manipulation is not associated with all or a portion of the 

polynucleotide with which it is associated in nature. The term "recombinant" as used 
with respect to a protein or polypeptide means a polypeptide produced by expression 
of a recombinant polynucleotide. In general, the gene of interest is cloned and then 
expressed in transformed organisms, as described further below. The host organism 

1 5 expresses the foreign gene to produce the protein under expression conditions. 

A "control element" refers to a polynucleotide sequence which aids in the 
expression of a coding sequence to which it is linked. The term includes promoters, 
transcription termination sequences, upstream regulatory domains, polyadenylation 
signals, untranslated regions, including 5'-UTRs and 3-UTRs and when appropriate, 

20 leader sequences and enhancers, which collectively provide for the transcription and 
translation of a coding sequence in a host cell. 

A "promoter" as used herein is a DNA regulatory region capable of binding 
RNA polymerase in a host cell and initiating transcription of a downstream (3 f 
direction) coding sequence operably linked thereto. For purposes of the present 

25 invention, a promoter sequence includes the minimum number of bases or elements 
necessary to initiate transcription of a gene of interest at levels detectable above 
background. Within the promoter sequence is a transcription initiation site, as well as 
protein binding domains (consensus sequences) responsible for the binding of RNA 
polymerase. Eucaryotic promoters will often^ but not always, contain "TATA" boxes 

30 and "CAT" boxes. 
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A control sequence "directs the transcription" of a coding sequence in a ceil 
when RNA polymerase will bind the promoter sequence and transcribe the coding 
sequence into mRNA, which is then translated into the polypeptide encoded by the 
coding sequence. 

5 "Expression cassette" or "expression construct" refers to an assembly which is 

capable of directing the expression of the sequence(s) or gene(s) of interest. The 
expression cassette includes control elements, as described above, such as a promoter 
which is operably linked to (so as to direct transcription of) the sequence(s) or gene(s) 
of interest, and often includes a polyadenylation sequence as well Within certain 

10 embodiments of the invention, the expression cassette described herein may be 

contained within a plasmid construct. In addition to the components of the expression 
cassette, the plasmid construct may also include, one or more selectable markers, a 
signal which allows the plasmid construct to exist as single-stranded DNA (e.g., a 
M13 origin of replication), at least one multiple cloning site, and a "mammalian" 

15 origin of replication (e.g., a SV40 or adenovirus origin of replication). 

"Transformation," as used herein, refers to the insertion of an exogenous 
polynucleotide into a host cell, irrespective of the method used for insertion: for 
example, transformation by direct uptake, transfection, infection, and the like. For 
particular methods of transfection, see further below. The exogenous polynucleotide 

20 may be maintained as a nonintegrated vector, for example, an episome, or 
alternatively, may be integrated into the host genome. 

A "host cell" is a cell which has been transformed, or is capable of 
transformation, by an exogenous DNA sequence. 

By "isolated" is meant, when referring to a polypeptide, that the indicated 

25 molecule is separate and discrete from the whole organism with which the molecule is 
found in nature or is present in the substantial absence of other biological macro- 
molecules of the same type. The term "isolated" with respect to a polynucleotide is a 
nucleic acid molecule devoid, in whole or part, of sequences normally associated with 
it in nature; or a sequence, as it exists in nature, but having heterologous sequences in 

30 . association therewith; or a molecule disassociated from the chromosome. 
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The term "purified" as used herein preferably means at least 75% by weight, 
more preferably at least 85% by weight, more preferably still at least 95% by weight, 
and most preferably at le^st 98% by weight, of biological macromolecules of the same 
type are present. 

5 "Homology" refers to the percent identity between two polynucleotide or two 

polypeptide moieties. Two DNA, or two polypeptide sequences are "substantially 
homologous" to each other when the sequences exhibit at least about 50% , preferably 
at least about 75%, more preferably at least about 80%-85%, preferably at least about 
90%, and most preferably at least about 95%-98%, or more, sequence identity over a 

10 defined length of the molecules. As used herein, substantially homologous also refers 
to sequences showing complete identity to the specified DNA or polypeptide 
sequence. 

In general, "identity" refers to an exact nucleotide-to-nucleotide or amino acid- 
to-amino acid correspondence of two polynucleotides or polypeptide sequences, 

1 5 respectively. Percent identity can be determined by a direct comparison of the 

sequence information between two molecules by aligning the sequences, counting the 
exact number of matches between the two aligned sequences, dividing by the length of 
the shorter sequence, and multiplying the result by 100. Readily available computer 
programs can be used to aid in the analysis, such as ALIGN, Dayhoff, M.O. in Atlas of 

20 Protein Sequence and Structure M.O. Dayhoff ed., 5 Suppl. 3:353-358, National 

biomedical Research Foundation, Washington, DC, which adapts the local homology 
algorithm of Smith and Waterman Advances inAppl Math, 2:482-489, 1981 for 
peptide analysis. Programs for determining nucleotide sequence identity are available 
in the Wisconsin Sequence Analysis Package, Version 8 (available from Genetics 

25 ■ Computer Group, Madison, WI) for example, the BESTFIT, FASTA and GAP 

programs, which also rely on the Smith and Waterman algorithm. These programs are 
readily utilized with the default parameters recommended by the manufacturer and 
described in the Wisconsin Sequence Analysis Package referred to above. For 
example, percent identity of a particular nucleotide sequence to a reference sequence 

30 can be determined using the homology algorithm of Smith and Waterman with a 
default scoring table and a gap penalty of six nucleotide positions. 



19 



WO 2004/039950 • PCT/US2003/033610 

Another method of establishing percent identity in the context of the present 
invention is to use the MPSRCH package of programs copyrighted by the University 
of Edinburgh, developed by John F. Collins and Shane S. Sturrok, and distributed by 
IntelliGenetics, Inc. (Mountain View, CA). From this suite of packages the Smith- 
5 Waterman algorithm can be employed wliere default parameters are used for the 

scoring table (for example, gap open penalty of 12, gap extension penalty of one, and 
a gap of six). From the data generated the "Match" value reflects "sequence identity." 
Other suitable programs for calculating the percent identity or similarity between 
sequences are generally known in the art, for example, another alignment program is 

10 BLAST, used with default parameters. For example, BLASTN and BLASTP can be 
used using the following default parameters: genetic code = standard; filter = none; 
strand = both; cutoff = 60; expect = 10; Matrix = BLOSUM62; Descriptions = 50 
sequences; sort by = HIGH SCORE; Databases = non-redundant, GenBank + EMBL + 
DDBJ + PDB + GenBank CDS translations + Swiss protein + Spupdate + PIR. 

1 5 Details of these programs can be found at the following internet address: 
http://www.ncbi.nlm.gov/cgi-bin/BLAST. 

Alternatively, homology can be determined by hybridization of 
polynucleotides under conditions which form stable duplexes between homologous 
regions, followed by digestion with singld-stranded-specific nuclease(s), and size 

20 determination of the digested fragments. DNA sequences that are substantially 
homologous can be identified in a Southern hybridization experiment under, for 
example, stringent conditions, as defined for that particular system. Defining 
appropriate hybridization conditions is within the skill of the art See, e.g., Sambrook 
et al., supra; DNA Cloning, supra; Nucleic Acid Hybridization, supra, 

25 By "nucleic acid immunization" is meant the introduction of a nucleic acid 

molecule encoding one or more selected antigens into a host cell, for the in vivo 
expression of the antigen or antigens. The hucleic acid molecule can be introduced 
directly into the recipient subject, such as by injection, inhalation, oral, intranasal and 
mucosal administration, or the like, or can be introduced ex vivo, into cells which have 

30 been removed from the host. In the latter case, the transformed cells are reintroduced 
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into the subject where an immune response can be mounted against the antigen 

encoded by the nucleic acid molecule. 

As used herein, "treatment" refers to any of (i) the prevention of infection or 

reinfection, as in a traditional vaccine, (ii) the reduction or elimination of symptoms, 
5 and (iii) the substantial or complete elimination of the pathogen in question. 

Treatment may be effected prophylactically (prior to infection) or therapeutically 

(following infection). 

By "vertebrate subject" is meant any member of the subphylum cordata, 

including, without limitation, humans and other primates, including non-human 
10 primates such as chimpanzees and other apes and monkey species; farm animals such 

as cattle, shfeep, pigs, goats and horses; domestic mammals such as dogs and cats; 

laboratory animals including rodents such as mice, rats and guinea pigs; birds, 

including domestic, wild and game birds such as chickens, turkeys and other 

gallinaceous birds, ducks, geese, and the like. The term does not denote a particular 
15 age. Thus, both adult and newborn individuals are intended to be covered. The 

invention described herein is intended for use in any of the above vertebrate species, 

since the immune systems of all of these vertebrates operate similarly. 

H Modes of Carrying out the Invention 
20 Before describing the present invention in detail, it is to be understood that this 

invention is not limited to particular formulations or process parameters as such may, 

of course, vary. It is also to be understood that the terminology used herein is for the 

purpose of describing particular embodiments of the invention only, and is not 

intended to be Umiting. * 
25 Although a number of compositions and methods similar or equivalent to 

those described herein can be used in the practice of the present invention, the 

preferred materials and methods are described herein. 

It is a discovery of the present invention that fusion proteins, combinations of 

the individual components of these fusions, and polynucleotides encoding the same, 
30 comprising an NS3, an NS4, and an NS5a polypeptide with or without a core 

polypeptide, or an NS3, an NS4, an NS5a, and an NS5b polypeptide, with or without a 
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core polypeptide, of an HCV virus can be used to activate HCV-specific T cells, i.e., T 
cells which recognize epitopes of these polypeptides. 

The present invention also pertains to compositions comprising HCV 
nonstructural fusion proteins and HCV E1E2 complexes, as well as compositions 
5 comprising polynucleotides encoding the same or combinations of polypeptides and 
polynucleotides. 

The proteins, polynucleotides, compositions and combinations of the present 
invention can be used to stimulate a cellular immune response, such as to activate 
HCV-specific T cells, i.e., T cells which recognize epitopes of these polypeptides. 
10 Activation of HCV-specific T cells provides both in vitro and in vivo model systems 
for the development of HCV vaccines, particularly for identifying HCV polypeptide 
epitopes associated with a response. The compositions can also be used to generate an 
immune response against HCV in a mammal, particularly a CTL response for either 
therapeutic or prophylactic purposes. 

15 

Fusion Proteins 

The genomes of HCV strains contain a single open reading frame of 
approximately 9,000 to 12,000 nucleotides, which is transcribed into a polyprotein. 
As shown in Figure 1 and the table below, an HCV polyprotein, upon cleavage, 

20 produces at least ten distinct products, in the order of NH 2 - Core-El-E2-p7~NS2-NS3- 
NS4a-NS4b-NS5a-NS5b-COOH. The core polypeptide occurs at positions 1-191, 
numbered relative to HCV-1 (see, Choo et al. (1991) Proc. Natl Acad. Set. USA 
88:245 1-2455, for the HCV-1 genome). This polypeptide is further processed to 
produce an HCV polypeptide with approximately amino acids 1-173. The envelope 

25 polypeptides, El and E2, occur at about positions 192-383 and 384-746, respectively. 
The P7 domain is found at about positions 747-809. NS2 is an integral membrane 
protein with proteolytic activity and is found at about positions 810-1026 of the 
polyprotein. NS2, in combination with NS3, (found at about positions 1027-1657), 
cleaves the NS2-NS3 sissle bond which in turn generates the NS3 N-terminus and 

30 releases a large polyprotein that includes both serine protease and RNA helicase 

activities. The NS3 protease, found at about positions 1027-1207, serves to process 



22 



WO 2004/039950 PCIYUS2003/033610 

the remaining polyprotein. The helicase activity is found at about positions 1 193- 
1657. NS3 liberates an NS3 cofactor (NS4a, found about positions 1658-171 1), two 
proteins (NS4b found at about positions 1712-1972, and NS5a found at about 
positions 1973-2420), and an RNA-dependent RNA polymerase (NS5b found at about 
5 positions 2421-301 1).. Completion of polyprotein maturation is initiated by 

autocatalytic cleavage at the NS3-Ns4a junction, catalyzed by the NS3 serine protease. 



Domain 


Annrnvinnntp RonnHarif^o^ 


C (core) 


1-191 


El 


192-383 


E2 


384-746 


P7 


747-809 


NS2 


810-1026 


NS3 


1027-1657 


NS4a 


1658-1711 


NS4b 


1712-1972 


NS5a 


1973-2420 


NS5b 


2421-3011 



20 *Numbered relative to HCV-1 . See, Choo et al. (1991) Proc. Natl Acad Sci. 

USA 88:2451-2455. 

Fusion proteins for use in the compositions and methods, and polynucleotides 
encoding therefor, include or encode anNS3 polypeptide, anNS4 (NS4a and/or 

25 NS4b) polypeptide, an NS5a polypeptide and, optionally, an NS5b polypeptide. The 
fusion proteins may or may not include all or part of the core region. In certain 
embodiments, none of the core region is present in the compositions. The 
nonstructural regions need not be in the order in which they naturally occur in the 
native HCV polyprotein. Thus, for example, the NS5b polypeptide may be at the N- 

30 and/or C-terminus of the fusion or may be found internally. These polypeptides may 
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be derived from the same HCV isolate, or from different strains and isolates 
including isolates having any of the various HCV genotypes, to provide increased 
protection against a broad range of HCV genotypes. Additionally, polypeptides can 
be selected based on the particular viral clades endemic in specific geographic 
5 regions where vaccine compositions containing the fusions will be used. It is readily 
apparent that the subject fusions provide an effective means of treating HCV 
infection in a wide variety of contexts. 

In one embodiment, the fusion protein of the present invention includes an 
NS3 polypeptide that has been modified to inhibit protease activity, such that further 

1 0 cleavage of the fusion is inhibited. The NS3 polypeptide can be modified by deletion 
of all or a portion of the NS3 protease domain. Alternatively, proteolytic activity can 
be inhibited by substitutions of amino acids within active regions of the protease 
domain. Finally, additions of amino acids to active regions of the domain, such that 
the catalytic site is modified, will also serve to inhibit proteolytic activity. 

15 As explained above, the protease activity is found at about amino acid 

positions 1027-1207, numbered relative to the full-length HCV-1 polyprotein (see, 
Choo et al, Proc. Natl Acad. Sci. USA (1991) 88:2451-2455), positions 2-182 of 
Figure 3. The structure of the NS3 protease and active site are known. See, e.g., De 
Francesco et al., Antivir. Ther. (1998) 3:99-109; Koch et al., Biochemistry (2001) 

20 40:63 1-640. Thus, deletions or modifications to the native sequence will typically 
occur at or near the active site of the molecule. Particularly, it is desirable to modify 
or make deletions to one or more amino acids occurring at positions 1- or 2-182, 
preferably 1- or 2-170, or 1- or 2-155 of Figure 3. Preferred modifications are to the 
catalytic triad at the active site of the protease, i.e., H, D or S residues, in order to 

25 inactivate the protease. These residues occur at positions 1083, 1 105 and 1 165, 

respectively, numbered relative to the full-length HCV polyprotein (positions 58, 80 
and 140, respectively, of Figure 3). Such modifications will suppress proteolytic 
cleavage while maintaining T-cell epitopes. 

One of skill in the art can readily determine portions of the NS3 protease to delete in 
30 order to disrupt activity. The presence or absence of activity can be determined using 
methods known to those of skill in the art. 
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For example, protease activity or lack thereof may be determined using assays 
well known in the art. See, e.g., Takeshita et al., Anal. Biochem. (1997) 247:242- 
246; Kakiuchi et al., J. Biochem. (1997) 122:749-755; Sali et al., Biochemistry 
(1998) 37:3392-3401; Cho et al., J. Virol. Meth (1998) 72:109-115; Cerretani et al., 

5 Anal. Biochem. (1999) 266:192-197; Zhang et al.,^na/. Biochem. (1999) 270:268- 
275; Kakiuchi et al., J. Virol. Meth. (1999) 80:77-84; Fowler et al., J. Biomol. 
Screen. (2000) 5:153-158; and Kim et si., Anal. Biochem (2000) 284:42-48. 

The NS3, NS4, NS5a, and NS5b polypeptides present in the various fusions 
described above can either be full-length polypeptides or portions of NS3, NS4 

10 (NS4a and/or NS4b), NS5a, and NS5b polypeptides. The portions of NS3, NS4, 
NS5a, and NS5b polypeptides making up the fusion protein preferably comprise at 
least one epitope, which is recognized by a T cell receptor on an activated T cell, 
such as 2152-HEYPVGSQL-2160 (SEQ ID NO:l) and/or 2224- 
AELEEANLLWRQEMG-2238 (SEQ ID NO:2). Epitopes of NS3, NS4 (NS4a and 

1 5 NS4b), NS5a, NS5b, NS3NS4NS5a, and NS3NS4NS5aNS5b can be identified by 
several methods. For example, NS3, NS4, NS5a, NS5b polypeptides or fusion 
proteins comprising any combination of the above, can be isolated, for example, by 
immunoaffinity purification using a monoclonal antibody for the polypeptide or 
protein. The isolated protein sequence can then be screened by preparing a series of 

20 short peptides by proteolytic cleavage of the purified protein, which together span the 
entire protein sequence. By starting with, for example, 100-mer polypeptides, each 
polypeptide can be tested for the presence of epitopes recognized by a T-cell receptor 
on an HCV-activated T cell, progressively smaller and overlapping fragments can 
then be tested from an identified 100-mer to map the epitope of interest. 

25 Epitopes recognized by a T-cell receptor on an HCV-activated T cell can be 

identified by, for example, 51 Cr release assay or by lymphoproliferation assay (see the 
examples). In a 5l Cr release assay, target cells can be constructed that display the 
epitope of interest by cloning a polynucleotide encoding the epitope into an 
expression vector and transforming the expression vector into the target cells. HCV- 

30 specific CD8 + T cells will lyse target cells displaying, for example, an NS3, NS4, 
NS5a, NS5b, NS3NS4NS5a, or NS3NS4NS5aNS5b epitope and will not lyse cells 
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that do not display such an epitope. In a lymphoproliferation assay, HCV-activated 
CD4 + T cells will proliferate when cultured with, for example, an NS3, NS4, NS5a, 
NS5b, NS3NS4NS5a, or NS3NS4NS5aNS5b epitopic peptide, but not in the absence 
of an HCV epitopic peptide, 
5 NS3, NS4, NS5a> and NS5b polypeptides can occur in any order in the fusion 

protein. If desired, at least 2, 3, 4, 5, 6, 7, 8, 9, or 10 or more of one or more of the 
polypeptides may occur in the fusion protein. Multiple viral strains of HCV occur, 
and NS3, NS4, NS5a, and NS5b polypeptides of any of these strains can be used in a 
fusion protein. A representative fusion protein for use in the present invention is 
10 shown if Figures 5A-5J. The depicted sequence includes amino acids 1242-301 1 of 
the HCV polyprotein (representing polypeptides from NS3, NS4, NS5a andNSSb) 
with amino acids 1-121 of the HCV polyprotein (representing a polypeptide from the 
core region) fused to the C-terminus of NS5b. This numbering is relative to the 
HCV-1 polyprotein. 

1 5 Nucleic acid and amino acid sequences of a number of HCV strains and 

isolates, including nucleic acid and amino acid sequences of NS3, NS4, NS5a, NS5b 
genes and polypeptides have been determined. For example, isolate HCV Jl.l is 
described in Kubo et al (1989) Japan. Nucl. Acids Res. 17:10367-10372; Takeuchi 
et a/.(1990) Gene 91:287-291; Takeuchi et al (1990) J. Gen. Virol. 71:3027-3033; 

20 and Takeuchi et al (1990) Nucl. Acids Res. 18:4626. The complete coding 

sequences of two independent isolates, HCV- J and BK, are described by Kato et al, 
: (1990) Proc. Natl. Acad. Sci. USA 87:9524-9528 and Takamizawa et al, (1991) J. 
Virol. 65:1105-1113 respectively. 

Publications that describe HCV-1 isolates include Choo et al (1990) Brit. 

25 Med. Bull. 46:423-441; Choo et al (1991) Proc. Natl. Acad. Sci. USA 88:2451-2455 
mdHanetal (1991) Proc. Natl. Acad. Sci. USA 88:171 1-1715. HCV isolates 
HC-J1 and HC-J4 are described in Okamoto et al (1991) Japan J. Exp. Med. 
60:167-177. HCV isolates HCT 18-, HCT 23, Th, HCT 27, EC1 andEClOare 
described in Weiner et al (1991) Virol. 180:842-848. HCV isolates Pt-1, HCV-K1 

30 and HCV-K2 are described in Enomoto et al (1990) Biochem. Biophys. Res. 
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Commun. 170:1021-1025. HCV isolates A, C, D & E are described in 
Tsukiyama-Kohara et al (1991) Virus Genes 5:243-254. 

Each of the NS3, NS4, NS5a, and NS5b components of a fusion protein can 
be obtained from the same HCV strain or isolate or from different HCV strains or 

5 isolates. Fusion proteins comprising HCV polypeptides from, for example, the NS3 

j 

polypeptide can be derived from a first strain of HCV, and the NS4, and NS5a 
polypeptides can be derived from a second strain of HCV. Alternatively, the NS4 
polypeptide can be derived from a first strain of HCV, and the NS3 and NS5a 
polypeptides can be derived from a second strain of HCV. Optionally, the NS5a 

10 polypeptide can be derived from a first strain of HCV, and the NS3 and NS4 

polypeptides can be derived from a second strain of HCV. NS3, NS4 and NS5a 
polypeptides that are each derived from different HCV strains can also be used in an 
HCV fusion protein. Similarly, in a fusion protein comprising NS5b, at least one of 
the NS3, NS4, NS5a, and NS5b polypeptides can be derived from a different HCV 

15 strain than the other polypeptides. Optionally, NS3, NS4, NS5a, and NS5b 

polypeptides that are' each derived from different HCV strains can also be used in an 
NS3NS4NS5aNS5b fusion protein. 

In addition to NS3, NS4a, NS4b, NS5a and NS5b, the fusion proteins can 
contain other polypeptides derived from the HCV polyprotein. For example, it may 

20 be desirable to include polypeptides derived from the core region of the HCV 
polyprotein. This region occurs at amino acid positions 1-191 of the HCV 
polyprotein, numbered relative to HCV-1. Either the full-length protein, fragments 
thereof, such as amino acids 1-150, e.g., amino acids 1-130, 1-120, for example, 
amino acids 1-121, 1-122, 1-123, etc., or smaller fragments containing epitopes of 

25 the full-length protein may be used in the subject fusions, such as those epitopes 
found between amino acids 10-53, amino acids 10-45, ammo acids 67-88, amino 
acids 120-130, or any of the core epitopes identified in, e.g., Houghton et al., U.S. 
Patent No. 5,350,671; Chien et al., Proa Natl Acad. Scl USA (1992) 89:1001 1- 
10015; Chien et al., J. Gastroent. Hepatol (1993) 8:S33-39; Chien et al., 

30 International Publication No. WO 93/00365; Chien, D.Y., International Publication 
No. WO 94/01778; and U.S. Patent Nos. 6,280,927 and 6,150,087. Moreover, a 
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protein resulting from a frameshift in the core region of the polyprotein, such as 
described in International Publication No. WO 99/63941, may be used. The fusions 
may also contain polynucleotides encoding. E1E2 polypeptides, as described further 
below. 

5 Preferably, the above-described fusion proteins, as well as the individual 

components of these proteins, are produced recombinantly. A polynucleotide 
encoding these proteins can be introduced into an expression vector which can be 
expressed in a suitable expression system. A variety of bacterial, yeast, mammalian 
and insect expression systems are available in the art and any such expression system 

10 can be used. Optionally, a polynucleotide encoding these proteins can be translated 
in a cell-free translation system. Such methods are well known in the art. The 
proteins also can be constructed by solid phase protein synthesis. 

If desired, the fusion proteins, or the individual components of these proteins, 
also can contain other non-HCV amino acid sequences, such as amino acid linkers or 

15 signal sequences, as well as ligands useful in protein purification, such as 
glutathione-S-transferase and staphylococcal protein A. 

E1E2 Polypeptides 

As explained above, the compositions of the present invention may also 
20 include El and E2 polypeptides, complexes of these polypeptides or polynucleotides 
encoding the same. The El and E2 polypeptides and complexes thereof can be 
provided independent of the nonstructural fusion protein or can be incorporated into 

the same fusion. Moreover, E1E2 complexes can be provided as proteins, or as 

t 
i 

polynucleotides encoding the same. 

25 In this regard, El, E2 and p7 are known to contain human T-cell epitopes 

(both CD4+ and CD8+) and including one or more of these epitopes serves to 
increase vaccine efficacy as well as to increase protective levels against multiple 
HCV genotypes. Moreover, multiple copies of specific, conserved T-cell epitopes 
can also be used in E1E2 complexes, such as a composite of epitopes from different 

30 genotypes. 
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As explained above, the El and E2 polypeptides that make up the E1E2 
complexes can be associated either through non-covalent or covalent interactions. 
Such complexes may be made up of immunogenic fragments of El and E2 which 
comprise epitopes. For example, fragments of El polypeptides can comprise from 
5 about 5 to nearly the full-length of the molecule, such as 6, 10, 25, 50, 75, 100, 125, 
150, 175, 185 or more amino acids of an El polypeptide, or any integer between the 
stated numbers. Similarly, fragments of E2 polypeptides can comprise 6, 10, 25, 50, 
75, 100, 150, 200, 250, 300, or 350 amino acids of anE2 polypeptide, or any integer 
between the stated numbers. The El and E2 polypeptides may be from the same or 

10 different HCV strains. For example, epitopes derived from, e.g., the hypervariable 
region of E2, such as a region spanning amino acids 384-410 of 390-410, can be 
included in the E2 polypeptide. A particularly effective E2 epitope to incorporate' 
into the E2 sequence or E1E2 complexes is one which includes a consensus sequence 
derived from this region, such as the consensus sequence Gly-Ser-Ala-Ala-Arg-Thr- 

1 5 Thr-Ser-Gly-Phe-Val-Ser-Leu-Phe-^^ (SEQ ID NO:5), 

which represents a consensus sequence for amino acids 390-410 of the HCV type 1 
genome. Additional epitopes of El and E2 are known and described in, e.g., Chien 
et al., International Publication No. WO 93/00365. 

Moreover, the El and E2 polypeptides may lack all or a portion of the 

20 membrane spanning domain. The membrane anchor sequence functions to associate 
the polypeptide to the endoplasmic reticulum. Normally, such polypeptides are 
capable of secretion into growth medium in which an organism expressing the 
protein is cultured. However, as described in International Publication No. WO 
98/50556, such polypeptides may also be recovered intracellularly. Secretion into 

25 growth medium is readily determined using a number of detection techniques, 

including, e.g., polyacrylamide gel electrophoresis and the like, and immunological 
techniques such as immunoprecipitation assays as described in, e.g., International 
Publication No. WO 96/04301, published February 15, 1996. With El, generally 
polypeptides terminating with about amino acid position 370 and higher (based on 

30 the numbering of HCV1 El) will be retained by the ER and hence not secreted into 
growth media. With E2, polypeptides terminating with about amino acid position 
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731 and higher (also based on the numbering of the HCV1 E2 sequence) will be 
retained by the ER and not secreted. (See, e.g., International Publication No. WO 
96/04301, published February 15, 1996). It should be noted that these amino acid 
positions are not absolute and may vary to some degree. Thus, the present invention 

5 contemplates the use of El and E2 polypeptides which retain the transmembrane 
binding domain, as well as polypeptides which lack all or a portion of the 
transmembrane binding domain, including El polypeptides terminating at about 
amino acids 369 and lower, and E2 polypeptides, terminating at about amino acids 
730 and lower, are intended to be captured by the present invention. Furthermore, 

10 the C-terminal truncation can extend beyond the transmembrane spanning domain 
towards the N-terminus. Thus, for example, El truncations occurring at positions 
lower than, e.g., 360 and E2 truncations occurring at positions lower than, e.g., 715, 
are also encompassed by the present invention. All that is necessary is that the 
truncated El and E2 polypeptides remain functional for their intended purpose. 

1 5 However, particularly preferred truncated El constructs are those that do not extend 
beyond about amino acid 300. Most preferred are those terminating at position 360. 
Preferred truncated E2 constructs are those with C-terminal truncations that do not 
extend beyond about amino acid position 715. Particularly preferred E2 truncations 
are those molecules truncated after any of amino acids 715-730, such as 725. If 

20 truncated molecules are used, it is preferable to use El and E2 molecules that are 
both truncated. 

E2 exists as multiple species (Spaete et al., Virol (1992) 188:819-830; Selby 
etaL, J Virol (1996) 70:5177-5182; Grakoui et al., Virol (1993)67:1385-1395; 
Tomei et al., J. Virol (1993) 67:4017-4026) and clipping and proteolysis may occur 

25 at the N- and C-termini of the El and E2 polypeptides. Thus, an E2 polypeptide for 
use herein may comprise at least amino acids 405-661, e.g., 400, 401, 402... to 661, 
such as 384-661, 384-715, 384-746, 384-749 or 384-809, or 384 to any C-teiminus 
between 661-809, of an HCV polyprotein, numbered relative to the full-length HCV- 
1 polyprotein. Similarly, preferable El polypeptides for use herein can comprise 

30 amino acids 192-326, 192-330, 192-333, 192-360, 192-363, 192-383, or 192 to any 
C-terminus between 326-383, of an HCV polyprotein. 
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The El and E2 polypeptides and complexes thereof may also be present as 
asialoglycoproteins. Such asialoglycoproteins are produced by methods known in the 
art, such as by using cells in which terminal glycosylation is blocked. When these 
proteins are expressed in such cells and isolated by GNA lectin affinity 
5 chromatography, the El and E2 proteins aggregate spontaneously. Detailed methods 
for producing these E1E2 aggregates are described in, e.g., U.S. Patent No. 
6,074,852. For example, E1E2 complexes are readily produced recombinantly, either 
as fusion proteins or by e.g., co-transfecting host cells with constructs encoding for 
the El and E2 polypeptides of interest. Co-transfection can be accomplished either 

10 in trans or cis, i.e., by using separate vectors or by using a single vector which bears 
both of the El and E2 genes. If done using a single vector, both genes can be driven 
by a single set of control elements or, alternatively, the genes can be present on the 
vector in individual expression cassettes, driven by individual control elements. 
Following expression, the El and E2 proteins will spontaneously associate. 

1 5 Alternatively, the complexes can be formed by mixing the individual proteins 

together which have been produced separately, either in purified or semi-purified 
form, or even by mixing culture media in which host cells expressing the proteins, 
have been cultured, if the proteins are secreted. Finally, the E1E2 complexes of the 
present invention may be expressed as a fusion protein wherein the desired portion of 

20 El is fused to the desired portion of E2. 

Moreover, the E1E2 complexes may be present as a heterogeneous mixture of 
molecules, due to clipping and proteolytic cleavage, as described above. Thus, a 
composition including E1E2 complexes may include multiple species of E1E2, such 
as E1E2 terminating at amino acid 746 (E1E2 746 ), E1E2 terminating at amino acid 

25 809 (E1E2 809 ), or any of the other various El and E2 molecules described above, 

such as E2 molecules with N-terminal truncations of from 1-20 amino acids, such as 
E2 species beginning at amino acid 387, amino acid 402, amino acid 403, etc. 

E1E2 complexes are readily.produced recombinantly, either as fusion proteins 
or by e.g., co-transfecting host cells with constructs encoding for the El and E2 

30 polypeptides of interest. Co-transfection can be accomplished either in trans or cis 9 
i.e., by using separate vectors or by using a single vector which bears both of the El 
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and E2 genes. If done using a single vector, both genes can be driven by a single set 
of control elements or, alternatively, the genes can be present on the vector in 
individual expression cassettes, driven by individual control elements. Following 
expression, the El and E2 proteins will spontaneously associate. Alternatively, the 

5 complexes can be formed by mixing the individual proteins together which have been 
produced separately, either in purified or semi-purified form, or even by mixing 
culture media in which host cells expressing the proteins, have been cultured, if the 
proteins are secreted. Finally, the E1E2 complexes of the present invention may be 
expressed as a fusion protein wherein the desired portion of El is fused to the desired 

10 portion of E2. 

Methods for producing E1E2 complexes from ftill-length, truncated El and 
E2 proteins which are secreted into media, as well as intracellularly produced 
truncated proteins, are known in the art. For example, such complexes may be 
produced recombinantly, as described in U.S. Patent No. 6,121,020; Ralston et al., J. 

15 Virol (1993) 67:6753 : 6761, Grakoui et al., 1 Virol (1993) 67:1385-1395; and 
Lanford et al., Virology (1993) 197:225-235. 

Polynucleotides Encoding the Fusion Proteins and E1E2 Complexes 

Polynucleotides contain less than an entire HCV genome and can be RNA or 

20 single- or double-stranded DNA. Preferably, the polynucleotides are isolated free of 
other components, such as proteins and lipids. The polynucleotides encode the 
fusion proteins, El and E2 polypeptides and complexes thereof, described above, and 
thus comprise coding sequences thereof. Polynucleotides of the invention can also 
comprise other non-HCV nucleotide sequences, such as sequences coding for linkers, 

25 signal sequences, or ligands useful in protein purification such as glutathione-S- 
transferase and staphylococcal protein A. 

Polynucleotides encoding the various HCV polypeptides can be isolated from 
a genomic library derived from nucleic acid sequences present in, for example, the 
plasma, serum, or liver homogenate of an HCV infected individual or can be 

30 synthesized in the laboratory, for example, using an automatic synthesizer. An 
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amplification method such as PGR can be used to amplify polynucleotides from 
either HCV genomic DNA or cDNA encoding therefor. 

Polynucleotides can comprise coding sequences for these polypeptides which 
occur naturally or can include artificial sequences which do not occur in nature. 
5 These polynucleotides can be ligated to form a coding sequence for the fusion 
proteins and E1E2 complexes using standard molecular biology techniques. If 
desired, polynucleotides can be cloned into an expression vector and transformed 
into, for example, bacterial, yeast, insect, or mammalian cells so that the fusion 
proteins of the invention can be expressed in and isolated from a cell culture. 

10 The expression constructs of the present invention, including the desired 

fusion, or individual expression constructs comprising the individual components of 
these fusions, may be used for nucleic acid immunization, to stimulate an 
immunological response, such as a cellular immune response, using standard gene 
delivery protocols. Methods for gene delivery are known in the art. See, e.g., U.S. 

15 Patent Nos. 5,399,346, 5,580,859, 5,589,466. Genes can be delivered either directly 
to the vertebrate subject or, alternatively, delivered ex vrvo, to cells derived from the 
subject and the cells reimplanted in the subject. For example, the constructs can be 
delivered as plasmid DNA, e.g., contained within a plasmid, such as pBR322, pUC, 
or ColEl 

20 Additionally, the expression constructs can be packaged in liposomes prior to 

delivery to the cells. Lipid encapsulation is generally accomplished using liposomes 
which are able to stably bind or entrap and retain nucleic acid. The ratio of 
condensed DNA to lipid preparation can vary but will generally be around 1 : 1 (mg 
DNA:micromoles lipid), or more of lipid. For a review of the use of liposomes as 

25 carriers for delivery of nucleic acids, see, Hug and Sleight, Biochim. Biophys. Acta, 
(1991) 1097 :1-17: Straubinger et all, inMethods ofEnzymology (1983), Vol. 101, pp. 
512-527. 

Liposomal preparations for use with the present invention include cationic 
(positively charged), anionic (negatively, charged) and neutral preparations, with 
30 cationic liposomes particularly preferred. Cationic liposomes are readily available. 
For example, N[l-2,3-dioleyloxy)propyi]-N,N^-triethylammoniiim (DOTMA) 
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liposomes are available under the trademark Lipofectin, from GIBCO BRL, Grand 
Island, NY. (See, also, Feigner et al., Proa Natl. Acad. Sci. USA (1987) 84:7413- 
7416). Other commercially available lipids include transfectace (DDAB/DOPE) and 
DOTAP/DOPE (Boerhinger). Other catiqnic liposomes can be prepared from readily 

5 available materials using techniques well known in the art. See, e.g., Szoka et al., 
Proa Natl. Acad. Sci. USA (1978) 75:4194-4198; PCT Publication No. WO 
90/1 1092 for a description of the synthesis of DOTAP (l,2-bis(oleoyloxy)-3- 
(trimethylammonio)propane) liposomes. The various liposome-nucleic acid 
complexes are prepared using methods known in the art. See, e.g., Straubinger et al., 

10 in METHODS OF IMMUNOLOGY (1983), Vol. 101, pp. 512-527; Szoka et al., 
Proa Natl. Acad. Sci. USA (1978) 75:4194-4198; Papahadjopoulos et al., Biochim. 
Biophys. Acta (1975) 394:483; Wilson et al., Cell (1979) 12:77); Deamer and 
Bangham, Biochim. Biophys. Acta (1976) 443:629; Ostro et al., Biochem. Biophys. 
Res. Commun. (1977) 76:836; Fraley et al., Proa Natl Acad. Sci. USA (1979) 

15 26:3348); Enoch and Strittmatter, Proa Natl. Acad. Sci. USA (1979) 26:145); Fraley 
et al, J. Biol. Chem. (1980) 255:10431; Szoka and Papahadjopoulos, Proa Natl 
Acad. Sci. USA (1978) 2£:145; and Schaefer-Ridder et al., Science (1982) 215:166. 

The DNA can also be delivered in cochleate lipid compositions similar to 
those described by Papahadjopoulos et al., Biochem. Biophys. Acta. (1975) 394:483- 

20 491. See, also, U.S. Patent Nos. 4,663,161 and 4,871,488. 

A number of viral, based systems have been developed for gene transfer into 
mammalian cells. For example, retroviruses provide a convenient platform for gene 
delivery systems, such as murine sarcoma virus, mouse mammary tumor virus, 
Moloney murine leukemia virus, and Rous sarcoma virus. A selected gene can be 

25 inserted into a vector and packaged in retroviral particles using techniques known in 
the art The recombinant virus can then be isolated and delivered to cells of the 
subject either in vivo or ex vivo. A number of retroviral systems have been described 
(U.S. Patent No. 5,219,740; Miller and Rosman, BioTechniques (1989) 2:980-990; 
Miller, A.D., Human Gene Therapy (1990) 1:5-14; Scarpa et al., Virology (1991) 

30 180:849-852; Bums et al., Proa Natl. Acad. Sci. USA (1993) 90:8033-8037; and 
Boris-Lawrie and Temin, Cur. Opin. Genet. Develop. (1993) 3:102-109. Briefly, 
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retroviral gene delivery vehicles of the present invention may be readily constructed 
from a wide variety of retroviruses, including for example, B, C, and D type 
retroviruses as well as spumaviruses and lenti viruses such as FIV, HIV, HTV-1, HIV- 
2 and SIV (see RNA Tumor Viruses, Second Edition, Cold Spring Harbor 
5 Laboratory, 1985). Such retroviruses may be readily obtained from depositories or 
collections such as the American Type Culture Collection ("ATCC"; 10801 
University Blvd., Manassas, VA 201 10-2209), or isolated from known sources using 
commonly available techniques. 

A number of adenovirus vectors have also been described, such as adenovirus 

10 Type 2 and Type 5 vectors. Unlike retroviruses which integrate into the host 
genome, adenoviruses persist extrachromosomally thus minimizing the risks 
associated with insertional mutagenesis (Haj-Ahmad and Graham, J, Virol (1986) 
57:267-274; Bett et al., 7. Virol (1993) 67:5911-5921; Mittereder et al., Human Gene 
Therapy (1994) 5:717-729; Seth et al., J. Virol (1994) 68:933-940; Barr et al., Gene 

15 Ttierapy (1994) 1:51-58; Berkner, K.L. BioTechniques (1988) 6:616-629; and Rich et 
al., Human Gene Therapy (1993) 4:461-476). 

Molecular conjugate vectors, such as the adenovirus chimeric vectors 
described in Michael et al., J. Biol Chem. (1993) 268:6866-6869 and Wagner et al, 
Proc. Natl Acad ScL USA (1992) 89:6099-6103, can also be used for gene delivery. 

20 Members of the Alphavirus genus, such as but not limited to vectors derived 

from the Sindbis and Semliki Forest viruses, VEE, will also find use as viral vectors 
for delivering the gene of interest. For a description of Sindbis-virus derived vectors 
useful for the practice of the instant methods, see, Dubensky et al., X Virol (1996) 
70:508-519; and International Publication Nos. WO 95/07995,and WO 96/17072. 

25 Other vectors can be used, including but not limited to simian virus 40 and 

cytomegalovirus. Bacterial vectors, such as Salmonella ssp. Yersinia enterocolitica, 
Shigella spp., Vibrio cholerae, Mycobacterium strain BCG, and Listeria 
monocytogenes can be used. Minichromosomes such as MC and MCI, 
bacteriophages, cosmids (plasmids into which phage lambda cos sites have been 

30 inserted) and replicons (genetic elements that are capable of replication under their 
own control in a cell) can also be used. 
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The expression constructs may also be encapsulated, adsorbed to, or 
associated with, particulate carriers; Such carriers present multiple copies of a 
selected molecule to the immune system and promote trapping and retention of 
molecules in local lymph nodes. The particles can be phagocytosed by macrophages 
5 and can enhance antigen presentation through cytokine release. Examples of 

particulate carriers include those derived from polymethyl methacrylate polymers, as 
well as microparticles derived from poly(lactides) and poly(lactide-co-glycolides), 
known as PLG. See, e.g., Jeffery et al, Pharm. Res. (1993) 10:362-368; and McGee 
et al., J. Microencap. (1996). 

10 One preferred method for adsorbing macromolecules onto prepared 

microparticles is described in International Publication No. WO 00/050006. Briefly, 
microparticles are rehydrated and dispersed to an essentially monomelic suspension 
of microparticles using dialyzable anionic or cationic detergents. Useful detergents 
include, but are not limited to, any of the various N-methylglucamides (known as 

15 MEGAs), such as heptanoyl-N-mettiylglucamide (MEGA-7), octanoyl-N- 
methylglucamide (MEGA-8), nonanoyl-N-methylglucamide (MEGA-9), and 
decanoyl-N-methyl-glucamide (MEGA- 10); cholic acid; sodium cholate; 
deoxycholic acid; sodium deoxycholate; taurocholic acid; sodium taurocholate; 
taurodeoxycholic acid; sodium taurodeoxycholate; 3-[(3- 

20 cholamidopropyl)dimethylammonio] -1-propane-sulfonate (CHAPS); 3-[(3- 

cholamidopropyl) dimethylammonio]-2-hydroxy-l-propane-sulfonate (CHAPSO); 
-dodecyl-N,N-dimethyl-3-ammonio- 1-propane-sulfonate (ZWITTERGENT 3-12); 
N,N-bis-(3-D-gluconeamidopropyl)-deoxycholamide (DEOXY-BIGCHAP); 
-octylglucoside; sucrose monolaurate; glycocholic acid/sodium glycocholate; 

25 laurosarcosine (sodium salt); glycodeoxycholic acid/sodium glycodeoxycholate; 
sodium dodceyl sulfate (SDS); 3-(trimethylsilyl)-l-propanesxilfonic acid (DSS); 
cetrimide (CTAB, the principal component of which is 

hexadecyltrimethylammonium bromide); hexadecyltrimethylammonium bromide; 
dodecyltrimethylammonium bromide; hexadecyltrimethyl-ammonium bromide; 
30 tetradecyltrimefliylammonium bromide; benzyl dimethyldodecylammonium 

bromide; benzyl dimethyl-hexadecylammonium chloride; and benzyl dimethyltetra- 
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decylammonium bromide. The above detergents are commercially available from 
e.g., Sigma Chemical Co., St. Louis, MO. Various cationic lipids known in the art 
can also be used as detergents. See Balasubramaniam et al., 1996, Gene Ther. 9 
3:163-72 and Gao, X., and L. Huang. 1995, Gene Ther., 2:7110-722. 
5 A wide variety of other methods can be used to deliver the expression 

constructs to cells. Such methods include DEAE dextran-mediated transfection, 
calcium phosphate precipitation, polylysine- or polyornithine-mediated transfection, 
or precipitation using other insoluble inorganic salts, such as strontium phosphate, 
aluminum silicates including bentonite and. kaolin, chromic oxide, magnesium 

10 silicate, talc, and the like. Other useful methods of transfection include 

electroporation, sonoporation, protoplast fusion, liposomes, peptoid delivery, or 
microinjection. See, e.g.,' Sambrook et al., supra, for a discussion of techniques for 
transforming cells of interest; and Feigner, P.L., Advanced Drug Delivery Reviews 
(1990) 5: 1 63-1 87, for a review of delivery systems useful for gene transfer. Methods 

15 of delivering DNA using electroporation are described in, e.g., U.S. Patent Nos. 
• 6,132,419; 6,451,002, 6,418,341, 6233,483, U.S. Patent Publication No. 
2002/0146831; and International Publication No. WO/0045823. 

Moreover, the HCV polynucleotides can be adsorbed to, or entrapped within, 
an IS COM. Classic ISCOMs are formed by combination of cholesterol, saponin, 

20 phospholipid, and immunogens, such as viral envelope proteins. Generally, the HCV 
molecules (usually with a hydrophobic region) are solubilized in detergent and added 
to the reaction mixture, whereby ISCOMs are formed with the HCV molecule 
incorporated therein. ISCOM matrix compositions are formed identically, but 
without viral proteins. Proteins with high positive charge may be electrostatically 

25 bound in the ISCOM particles, rather than through hydrophobic forces. For a more 
detailed general discussion of saponins and ISCOMs, and methods of formulating 
ISCOMs, see Ban et al. (1998) Adv. Drug Delivery Reviews 32:247-271 (1998); U.S. 
Patent Nos. 4,981,684, 5,178,860, 5,679,354 and 6,027,732; European Publ. Nos. 
EPA 109,942; 180,564 and 231,039; and Coulter et al. (1998) Vaccine 16:1243. 

30 Additionally, biolistic delivery systems employing particulate carriers such as 

gold and tungsten, are especially useful for delivering the expression constructs of 
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the present invention. The particles are coated with the construct to be delivered and 
accelerated to high velocity, generally under a reduced atmosphere, using a gun 
powder discharge from a "gene gun." For a description of such techniques, and 
apparatuses useful therefore, see, e.g., U.S. Patent Nos. 4,945,050; 5,036,006; 
5 5,100,792; 5,179,022; 5,371,015; and 5,478,744. 

Compositions Comprising Fusion Proteins or Polynucleotides 

The invention also provides compositions comprising the fusion proteins or 
polynucleotides, as well as compositions including the individual components of 
10 these fusion proteins or polynucleotides. Compositions of the invention preferably 
comprise a pharmaceutically acceptable carrier. The carrier should not itself induce 
the production of antibodies harmful to the host. Pharmaceutically acceptable 

i 

carriers are well known to those in the art. Such carriers include, but are not limited 
to, large, slowly metabolized, macromolecules, such as proteins, polysaccharides 
15 such as latex functionalized sepharose, agarose, cellulose, cellulose beads and the 
like, polylactic acids, polyglycolic acids, polymeric amino acids such as 
polyglutamic acid, polylysine, and the like, amino acid copolymers, and inactive 
virus particles. 

Pharmaceutically acceptable salts can also be used in compositions of the 
20 invention, for example, mineral salts such as hydrochlorides, hydrobromides, 
phosphates, or sulfates, as well as salts of organic acids such as acetates, 
proprionates, malonates, or benzoates. Especially useful protein substrates are serum 
albumins, keyhole limpet hemocyanin, immunoglobulin molecules, thyroglobulin, 
ovalbumin, tetanus toxoid, and other proteins well known to those of skill in the art. 
25 Compositions of the invention can also contain liquids or excipients, such as water, 
saline, glycerol, dextrose, ethanol, or the like, singly or in combination, as well as 
substances such as wetting agents, emulsifying agents, or pH buffering agents. 
Liposomes can also be used as a carrier for a composition of the invention, such 
liposomes are described above. 
30 If desired, co-stimulatory molecules which improve immunogen presentation 

to lymphocytes, such as B7-1 or B7-2, or cytokines such as GM-CSF, IL-2, and IL- 

38 



WO 2004/039950 PCT/US2003/033610 

12, can be included in a composition of the invention. Optionally, adjuvants can also 
be included in a composition. Adjuvants which can be used include, but are not 
limited to: (1) aluminum salts (alum), such as aluminum hydroxide, aluminum 
phosphate, aluminum sulfate, etc.; (2) oil-in-water emulsion formulations (with or 

5 without other specific immunostimulating agents such as muramyl peptides (see 

below) or bacterial cell wall components), such as for example (a) MF59 (U.S. Patent 
No. 6,299,884; Chapter 10 in Vaccine design: the subunit and adjuvant approach, 
eds. Powell & Newman, Plenum Press 1995), containing 5% Squalene, 0.5% 
TWEEN 80™, and 0.5% SPAN 85™ (optionally containing various amounts of 

1 0 MTP-PE (see below), although not required) formulated into submicron particles 
using a microfluidizer such as Model 1 10Y microfluidizer (Microfluidics, Newton, 
MA), (b) SAF, containing 10% Squalane, 0.4% TWEEN 80™, 5% pluronic-blocked 
polymer LI 21, and thr-MDP either microfluidized into a submicron emulsion or 
vortexed to generate a larger particle size emulsion, and (c) RIBI™ adjuvant system 

15 (RAS), (Ribi Immunochem, Hamilton, MT) containing 2% Squalene, 0.2% TWEEN 
80™, and onje or more bacterial cell wall components from the group consisting of 
monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton 
(CWS), preferably MPL + CWS (DETOX™); (3) saponin adjuvants, such as QS21 
or STIMULON™ (Cambridge Bioscience, Worcester, MA) may be used or particles 

20 generated therefrom such as ISCOMs (immunostimulating complexes), which 

ISCOMs may be devoid of additional detergent, see, e.g., International Publication 
No. WO 00/07621; (4) Complete Freund's Adjuvant (CFA) and Incomplete Freund's 
Adjuvant (IF A); (5) cytokines, such as interleukins (IL-1, IL-2, IL-4, IL-5, IL-6, IL- 
7, IL-12 (International Publication No. WO 99/44636), etc.), interferons (e.g., 

25 gamma interferon), macrophage colony stimulating factor (M-CSF), tumor necrosis 
factor (TNF), etc.; (6) detoxified mutants of abacterial ADP-ribosylating toxin such 
as a cholera toxin (CT), a pertussis toxin (PT), or an E, coli heat-labile toxin (LT), 
particularly LT-K63 (where lysine is substituted for the wild-type amino acid at 
position 63) LT-R72 (where arginine is substituted for the wild-type amino acid at 

30 position 72), CT-S109 (where serine is substituted for the wild-type amino acid at 
position 109), and PT-K9/G129 (where lysine is substituted for the wild-type amino 
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acid at position 9 and glycine substituted at position 129) (see, e.g., International 
Publication Nos. W093/13202 and W092/19265); (7) MPL or 3-O-deacylated MPL 
(3dMPL) (see, e.g., GB 2220221), EP-A-0689454, optionally in the substantial 
absence of alum when used with pneumococcal saccharides (see, e.g., International 

5 Publication No. WO 00/56358); (8) combinations of 3dMPL with, for example, 
QS21 and/or oil-in-water emulsions (see, e.g., EP-A-0835318, EP-A-0735898, EP- 
A-0761231; (9) oligonucleotides comprising CpG motifs (see, e.g., Roman et al. 
(1997) Nat Med. 3:849-854; Weiner et al. (1997) Proc. Natl. Acad. Sci. USA 
94:10833-10837; Davis et al. (1998) X Immunol. 160:870-876; Chu et al. (1997) X. 

10 Exp. Med. 186:1623-1631; Lipford et al, (1997) Eur. X Immunol. 27:2340-2344; 
Moldoveanu et al. (1988) Vaccine 16:1216-1224; Krieg et al, (1995) Nature 
374:546.549; Klinman et al. (1996) Proc. Natl. Acad. Sci. USA 93:2879-2883; Ballas 
et al. (1996) X Immunol. 157:1840-1845; Cowdery et al. (1996) X Immunol. 
156:4570-4575; Halpern et al. (1996) Cell Immunol. 167:72-78; Yamamoto et al. 

15 (1988) Xpn. X Cancer Res. 79:866-873; Stacey et al. (1996) X Immunol. 157:21 16- 
2122; Messina et al. (1991) X Immunol. 147:1759-1764; Yi et al. (1996) X Immunol. 
157:4918-4925; Yi et al. (1996) X Immunol. 157:5394-5402: Yi et al. (1998) J. 
Immunol. 160:4755-4761; Yi et al. (1998)/. Immunol. 160:5898-5906; International 
Publication Nos. WO 96/02555, WO 98/16247, WO 98/18810, WO 98/40100, WO 

20 98/55495, WO 98/37919 and WO 98/52581), such as those containing at least on CG 
dinucleotide, with cytosine optionally replaced with 5-methylcytosine; (10) a 
polyoxyethylene ether or apolyoxyethylene ester (see, e.g., International Publication 
No. WO 99/52549); (1 1) a polyoxyethylene sorbitan ester surfactant in combination 
with an octoxynol (see, e.g., International Publication No. WO 01/21207) or a 

25 polyoxyethylene alkyl ether or ester surfactant in combination with at least one 
additional non-ionic surfactant such as an octoxynol (see, e.g., International 
Publication No. WO 01/21152); (12) a saponin and an immunostimulatory 
oligonucleotide such as a CpG oligonucleotide (see, e.g., International Publication 
No. WO 00/62800); (13) an immunostimulant and a particle of metal salt (see, e.g., 

30 International Publication No. WO 00/23105); and (14) other substances that act as 
immunostimulating agents to enhance the effectiveness of the composition. 
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Muramyl peptides include, but are not limited to, N-acetyl-muramyl-L- 
threonyl-D-isoglutamine (thr-MDP), N-acteyl-normuramyl-L-alanyl-D-isogluatme 
(nor-MDP), -acetylmuramyl-L-alanyl-D-isogluatminy 
dipalmitoyl-srt-glycero-34iuydroxyphospho (MTP-PE), etc. 

5 Moreover, the HCV proteins can be adsorbed to, or entrapped within, an 

ISCOM, as described above. Additionally, ISCOMs with adsorbed HCV core 
proteins, either the entire core region or a fragment of HCV core protein, may be 
added to the formulations. Most preferably, the HCV core protein is a fragment 
comprising a polypeptide from the region spanning amino acid positions 121-135. 

10 See, e.g., International Publication No. WO 01/37869A. 

As explained above, the composition may also contain immunostimulatory 
molecules, either in addition to or in place of the antigen delivery system. » 
Immunostimulatory agents for use herein include, without limitation, 
monophosphorylipid A (MPL), trehalose dimycolate (TDM), and cell wall skeleton 

15 (CWS), preferably MPL + CWS (Detox™). MPL may be formulated into an 

emulsion to enhance its immunostimulatory affect. See, e.g., Ulrich et al., "MPLr 
immunostimulat: adjuvant formulations, 11 in Vaccine Adjuvants; Prepartion Methods 
and Research Protocols (O'Hagan DT, ed.) Human Press Inc., NJ (2000) pp. 273- 
282. MPL has been shown to induce the synthesis and release of cytokines, 

20 particularly IL-2 and IFN-7. Other useful immunostimulatory molecules include 

LPS and immunostimulatory nucleic acid sequences (ISS), including but not limited 
to, unmethylated CpG motifs, such as CpG oligonucleotides. 

Oligonucleotides containing unmethylated CpG motifs have been shown to 
induce activation of B ceils, NK cells and antigen-presenting ceils (APCs), such as 

25 monocytes and macrophages. See, e.g., U.S. Patent No. 6,207,646. Thus, adjuvants 
derived from the CpG family of molecules, CpG dinucleotides and synthetic 
oligonucleotides which comprise CpG motifs (see, e.g., Krieg et al. Nature (l995) 
374:546 and Davis et al. J. Immunol. (1998) 160:870-876) such as any of the various 
immunostimulatory CpG oligonucleotides disclosed in U.S. Patent No. 6,207,646, 

30 may be used in the subject methods and compositions. Such CpG oligonucleotides 
generally comprise at least 8 up to about 100 basepairs, preferably 8 to 40 basepairs, 

41 



WO 2004/039950 



PCTAJS2003/033610 



more preferably 15-35 basepairs, preferably 15-25 basepairs, and any number of 
basepairs between these values. For example, oligonucleotides comprising the 
consensus CpG motif, represented by the formula 5 l -X 1 CGX 2 -3\ where Xi and X 2 are 
nucleotides and C is unmethylated, will find use as immunostimulatory CpG 
5 molecules. Generally, X, is A, G or T, and X 2 is C or T. Other useful CpG 

molecules include those captured by the formula 5 , -X 1 X 2 CGX 3 X4, where X, and X 2 
are a sequence such as GpT, GpG, GpA, ApA, ApT, ApG, CpT, CpA, CpG, TpA, 
TpT or TpG, and X 3 and X4 are TpT, CpT, ApT, ApG, CpG, TpC, ApC, CpC, TpA, 
ApA, GpT, CpA, or TpG, wherein "p" signifies a phosphate bond. Preferably, the 

10 oligonucleotides do not include a GCG sequence at or near the 5'- and/or 3 ! terminus. 
Additionally, the CpG is preferably flanked on its 5'-end with two purines (preferably 
a GpA dinucleotide) or with a purine and a pyrimidine (preferably, GpT), and 
flanked on its 3 -end with two pyrimidines, preferably a TpT or TpC dinucleotide. 
Thus, preferred molecules will comprise the sequence GACGTT, GACGTC, 

15 GTCGTT or GTCGCT, and these sequences will be flanked by several additional 
nucleotides. The nucleotides outside of this central core area appear to be extremely 
amendable to change. 

Moreover, the CpG oligonucleotides for use herein may be double- or single- 
stranded. Double-stranded molecules are more stable in vivo while single-stranded 

20 molecules display enhanced immune activity. Additionally, the phosphate backbone 
may be modified, such as phosphorodithioate-modified, in order to enhance the 
immunostimulatory activity of the CpG molecule. As described in U.S. Patent No. 
6,207,646, CpG molecules with phosphorothioate backbones preferentially activate 
B-cells, while those having phosphodidster backbones preferentially activate 

25 monocytic (macrophages, dendritic cells and monocytes) and NK cells. 

One exemplary CpG oligonucleotide for use in the present compositions has 
the sequence S-TCCATGACGTTCCTGACGTT-S' (SEQ ID NO:6). 

CpG molecules can readily be tested for their ability to stimulate an immune 
response using standard techniques, well known in the art. For example, the ability 

30 of the molecule to stimulate a humoral and/or cellular immune response is readily 
determined using the immunoassays described above. Moreover, the antigen and 
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adjuvant compositions can be administered with and without the CpG molecule to 
determine whether an immune response is enhanced. 

The HCV proteins may also be encapsulated, adsorbed to, or associated with, 
particulate carriers, as described above with reference to the HCV polynucleotides. 
5 As explained above, examples of particulate carriers include those derived from 
polymethyl methacrylate polymers, as well as microparticles derived from 
poly(lactides) and poly0actide-co-glycolides), known as PLG. See, e.g., Jeffery et 
al„ Pharrn. Res, (1993) 10:362-368; and McGee et al., X Microencap. (1996). One 
preferred method for adsorbing macromolecules onto prepared microparticles is 
10 described above and in International Publication No. WO 00/050006. 

Methods of Producing HCV-Specific Antibodies 

The HCV proteins can be used to produce HCV-specific polyclonal and 
monoclonal antibodies. HCV-specific polyclonal and monoclonal antibodies 

15 specifically bind to HCV antigens. Polyclonal antibodies can be produced by 

administering the fusion protein to a mammal, such as a mouse, a rabbit, a goat, or a 
horse. Serum from the immunized animal is collected and the antibodies are purified 
from the plasma by, for example, precipitation with ammonium sulfate, followed by 
chromatography, preferably affinity chromatography Techniques for producing and 

20 processing polyclonal antisera are known in the art. 

Monoclonal antibodies directed against HCV-specific epitopes present in the 
proteins can also be readily produced. Normal B cells from a mammal, such as a 
mouse, immunized with an HCV protein, can be fused with, for example, HAT- 
sensitive mouse myeloma cells to produce hybridomas. Hybridomas producing 

25 HCV-specific antibodies can be identified using RIA or ELISA and isolated by 

cloning in semi-solid agar or by limiting dilution. Clones producing HCV-specific 
antibodies are isolated by another round of screening. 

Antibodies, either monoclonal and polyclonal, which are directed against 
HCV epitopes, are particularly useful for detecting the presence of HCV or HCV 

30 antigens in a sample, such as a serum sample from an HCV-infected human. An 

immunoassay for an HCV antigen may utilize one antibody or several antibodies. An 
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immunoassay for an HCV antigen may use, for example, a monoclonal antibody 
directed towards an HCV epitope, a combination of monoclonal antibodies directed 
towards epitopes of one HCV polypeptide, monoclonal antibodies directed towards 
epitopes of different HCV polypeptides, polyclonal antibodies directed towards the 

5 same HCV antigen, polyclonal antibodies directed towards different HCV antigens, 
or a combination of monoclonal and polyclonal antibodies. Immunoassay protocols 
may be based, for example, upon competition, direct reaction, or sandwich type 
assays using, for example, labeled antibody. The labels may be, for example, 
fluorescent, chemiluminescent, or radioactive. 

10 The polyclonal or monoclonal antibodies may further be used to isolate HCV 

particles or antigens by immunoaffinity columns. The antibodies can be affixed to a 
solid support by, for example, adsorption or by covalent linkage so that the 
antibodies retain their immunoselective activity. Optionally, spacer groups may be 
included so that the antigen binding site of the antibody remains accessible. The 

15 immobilized antibodies can then be used to bind HCV particles or antigens from a 
biological sample, such as blood or plasma. The bound HCV particles or antigens 
are recovered from the column matrix by, for example, a change in pH. 

HCV-Specific T cells 

20 HCV-specific T cells that are activated by the above-described fusions and 

E1E2 complexes, including the NS3NS4NS5a fusion protein or NS3NS4NS5aNS5b 
fusion protein, and the E1E2 complexes, expressed in vivo or in vzYro, preferably 
recognize an epitope of an HCV polypeptide such as an El, E2, NS3 9 NS4, NS5a, 
NS5b polypeptide, including an" epitope of an NS3NS4NS5a fusion protein or an 

25 NS3NS4NS5aNS5b fusion protein, or an E1E2 complex. HCV-specific T cells can 
be CD8 + or CD4 + . 

HCV-specific CD8 + T cells preferably are cytotoxic T lymphocytes (CTL) 
which can kill HCV-infected cells that display El, E2, NS3, NS4, NS5a, NS5b 
epitopes complexed with an MHC class I molecule. HCV-specific CD8 + T cells may 
30 also express interferon-*/ (IFN-7). HCV-specific CD8 + T cells can be detected by, for 
example, 51 Cr release assays (see the examples). 51 Cr release assays measure the 
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ability of HCV-specific CD8 + T cells to lyse target cells displaying an El, E2, E1E2, 
NS3, NS4, NS5a, NS5b, NS3NS4NS5a, orNS3NS4NS5aNS5b epitope. HCV- 
specific CD8 + T cells which express IFN-Y can also be detected by immunological 
methods, preferably by intracellular staining for IFN-y after in vitro stimulation with 
5 an El, E2, NS3, an NS4, an NS5a, or an NS5b polypeptide (see the examples). 

HCV-specific CD4 + cells activated by the above-described E1E2 complexes 
and fusions, such as an El polypeptide, an E2 polypeptide, an E1E2 complex, 
NS3NS4NS5a or NS3NS4NS5aNS5b fusion protein, expressed in vivo or in vitro, 
preferably recognize an epitope of an El, E2, NS3, NS4, NS5a, or NS5b polypeptide, 

10 including an epitope of an E1E2 complex, NS3NS4NS5a or NS3NS4NS5aNS5b 
fusion protein, that is bound to an MHC class II molecule on an IJCV-infected cell 
and proliferate in response to stimulating E1E2 complexes withNS3NS4NS5a or 
NS3NS4NS5aNS5b peptides, with or without a core polypeptide. 

HCV-specific CD4 + T cells can be detected by a lymphoproliferation assay 

15 (see examples). Lymphoproliferation assays measure the ability of HCV-specific 
CD4 + T cells to proliferate in response to an El, E2, NS3, an NS4, an NS5a, or an 
NS5b epitope. 

Methods of Activating HCV-Speciflc T Cells. 

20 The HCV proteins or polynucleotides can be used to stimulate an immune 

response, such as to activate HCV-specific T cells either in vitro or in vivo. 
Activation of HCV-specific T cells can be used, inter alia, to provide model systems 
to optimize CTL responses to HCV and to provide prophylactic or therapeutic 
treatment against HCV infection For in vitro activation, proteins are preferably 

25 supplied to T cells via a plasmid or a viral vector, such as an adenovirus vector, as 
described above. 

Polyclonal populations of T cells can be derived from the blood, and 
preferably 

from peripheral lymphoid organs, such as lymph nodes, spleen, or thymus, of 
30 mammals that have been infected with an HCV. Preferred mammals include mice, 
chimpanzees, baboons, and humans. The HCV serves to expand the number of 
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activated HCV-specific T cells in the mammal. The HCV-specific T cells derived 
from the mammal can then be restimulated in vitro by adding, e.g., HCV E1E2 and 
NS3NS4NS5a or NS3NS4NS5aNS5b epitopic peptides, with or without a core 
polypeptide, to the T cells. The HCV-specific T cells can then be tested for, inter 
5 alia, proliferation, the production of IFN-7, and the ability to lyse target cells 
displaying E1E2, NS3NS4NS5a or NS3NS4NS5aNS5b epitopes in vitro. 

In a lymphoproliferation assay (see examples), HCV-activated CD4 + T cells 
proliferate when cultured with anNS3, NS4, NS5a, NS5b, NS3NS4NS5a, or 
NS3NS4NS5aNS5b epitopic peptide, but not in the absence of an epitopic peptide. 

10 Thus, particular El, E2, NS3, NS4, NS5a, NS5b, NS3NS4NS5a and 

NS3NS4NS5aNS5b epitopes that are recognized by HCV-specific CD4 + T cells can 
be identified using a lymphoproliferation assay. 

Similarly, detection of IFN-7 in HCV-specific CD8 + T cells after in vitro 
stimulation with the above-described HCV proteins, can be used to identify El, E2, 

15 E1E2, NS3, NS4, NS5a, NS5b, NS3NS4NS5a, and NS3NS4NS5aNS5b epitopes that 
particularly effective at stimulating CD8 + T cells to produce IFN-y (see examples). 

Further, 51 Cr release assays are useful for determining the level of CTL 
response to HCV. See Cooper et al Immunity 10:439-449. For example, HCV- 
specific CD8 + T cells can be derived from the liver of an HCV infected mammal. 

20 These T cells can be tested in 5I Cr release assays against target cells displaying, e.g., 
E1E2, NS3NS4NS5a and/or NS3NS4NS5aNS5b epitopes. Several target cell 
populations expressing different E1E2, NS3NS4NS5a and/or NS3NS4NS5aNS5b 
epitopes can be constructed so that each target cell population displays different 
epitopes of E1E2, NS3NS4NS5a and/or NS3NS4NS5aNS5b. Hie HCV-specific 

25 CD8 + cells can be assayed against each of these target cell populations. The results 
of the 51 Cr release assays can be used to determine which epitopes of E1E2, 
NS3NS4NS5a and/or NS3NS4NS5aNS5b are responsible for the strongest CTL 
response to HCV. E1E2 complexes, NS3NS4NS5a fusion proteins or 
NS3NS4NS5aNS5b fusion proteins, with or without core polypeptides, which 
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contain the epitopes responsible for the strongest CTL response can then be 
constructed using the information derived from the 51 Cr release assays. 

HCV proteins as described above, or polynucleotides encoding such proteins, 
can be administered to a mammal, such as a mouse, baboon, chimpanzee, or human, 
5 to stimulate an immune response, such as to activate HCV-specific T cells in vivo. 
Administration can be by any means known in the art, including parenteral, 
intranasal, intramuscular or subcutaneous injection, including injection using a 
biological ballistic gun ("gene gun"), as discussed above. 

Preferably, injection of an HCV polynucleotide is used to activate T cells. In 

1 0 addition to the practical advantages of simplicity of construction and modification, 
injection of the polynucleotides results in the synthesis of a fusion protein in the host. 
Thus, these immunogens are presented to the host immune system with native post- 
translational modifications, structure, and conformation. The polynucleotides are 
preferably injected intramuscularly to a large mammal, such as a human, at a dose of 

15 0.5, 0.75, 1.0, 1.5, 2.0, 2.5, 5 or lOmg/kg. 

A composition of the invention comprising the HCV proteins or 
polynucleotides is administered in a manner compatible with the particular 
composition used and in an amount which is effective to stimulate an immune 
response, such as to activate HCV-specific T cells as measured by, inter alia, a 51 Cr 

20 release assay, a lymphoproliferation assay, or by intracellular staining for IFN-y. The 
proteins and/or polynucleotides can be administered either to a mammal which is not 
infected with an HCV or can be administered to an HCV-infected mammal. The 
particular dosages of the polynucleotides or proteins in a composition will depend on 
many factors including, but not limited to the species, age, and general condition of 

25 the mammal to which the composition is administered, and the mode of 

administration of the composition. An effective amount of the composition of the 
invention can be readily determined using only routine experimentation. In vitro and 
in vivo models described above can be employed to identify appropriate doses. The 
amount of polynucleotide used in the example described below provides general 

30 guidance which can be used to optimize the activation of HCV-specific T cells either 
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in vivo or in vitro. Generally, 0.5, 0.75, 1.0, 1.5, 2.0, 2.5, 5 or 10 mg of an HCV 
fusion and El and E2 polypeptides, such as an E1E2 complex, an NS3NS4NS5a or 
NS3NS4NS5aNS5b fusion protein or polynucleotide, with or without a core 
polypeptide, will be administered to a large mammal, such as a baboon, chimpanzee, 
5 or human. If desired, co-stimulatory molecules or adjuvants can also be provided 
before, after, or together with the compositions. 

Immune responses of the mammal generated by the delivery of a composition 
of the invention, including activation of HCV-specific T cells, can be enhanced by 
varying the dosage, route of administration, or boosting regimens. Compositions of 
10 the invention may be given in a single dose schedule, or preferably in a multiple dose 
schedule in which a primary course of vaccination includes 1-10 separate doses, 
followed by other doses given at subsequent time intervals required to maintain 
and/or reenforce an immune response, for example, at 1-4 months for a second dose, 
and if needed, a subsequent dose or doses after several months. 

15 

Deposits of Strains Useful in Practicing the Invention 

A deposit of biologically pure cultures of the following strains was made with 
the American Type Culture Collection, 10801 University Boulevard, Manassas, VA. 
The accession number indicated was assigned after successful viability testing, and 

20 the requisite fees were paid, made under the provisions of the Budapest Treaty on the 
International Recognition of the Deposit of Microorganisms for the Purpose of Patent 
Procedure and the Regulations thereunder (Budapest Treaty). This assures 
maintenance of viable cultures for a period of thirty (30) years from the date of 
deposit The organisms will be made available by the ATCC under the terms of the 

25 Budapest Treaty, which assures permanent and unrestricted availability of the 

progeny to one determined by the U.S. Commissioner of Patents and Trademarks to 
be entitled thereto according to 35 U.S.C. §122 and the Commissioners rules 
pursuant thereto (including 37 C.F.R. §1.12 with particular reference to 886 OG 
638). Upon the granting of a patent, all restrictions on the availability to the public of 

30 the deposited cultures will be irrevocably removed. 
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These deposits are provided merely as convenience to those of skill in the art, 
and are not an admission that a deposit is required under 35 U.S.C, § 1 12. The 
nucleic acid sequences of these genes, as well as the amino acid sequences of the 
molecules encoded thereby, are incorporated herein by reference and are controlling 
in the event of any conflict with the description herein. A license may be required to 
make, use, or sell the deposited materials, and no such license is hereby granted. 

Plasmid Depbsit Date ATCC No. 

E1E2-809 August 16, 2001 PTA-3643 



m. Experimental 

Below are examples of specific embodiments for carrying out the present 
invention. The examples are offered for illustrative purposes only, and are not 
intended to limit the scope of the present invention in any way. Those of skill in the 
art will readily appreciate that the invention may be practiced in a variety of ways 
given the teaching of this disclosure. 

Efforts have been made to ensure accuracy with respect to numbers used (e.g., 
amounts, temperatures, etc.), but some experimental error and deviation should, of 
course, be allowed for. 

EXAMPLE 1 

Production of NS3NS4NS5a Polynucleotides. 

A polynucleotide encoding NS3NS4NS5a (approximately amino acids 1027 
to 2399, numbered relative to HCV-1) (also termed "NS345a" herein) or NS5a 
(approximately amino acids 1973 to 2399, numbered relative to HCV-1) was isolated 
from an HCV. Polynucleotides encoding a methionine residue were ligated to the 5' 
end of these polynucleotides and the polynucleotides were cloned into plasmid, 
vaccinia virus, and adenovirus vectors. 
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Immunization Protocols, In one immunization protocol, mice were 
immunized with 50 y.% of plasmid DNA encoding either NS5a or encoding an 
NS3NS4NS5a fusion protein by intramuscular injection into the tibialis anterior. A 
booster injection of 10 7 pfix of vaccinia virus (W)-NS5a (intraperitoneal) or 50 jig of 

5 plasmid control (intramuscular) was provided 6 weeks later. 

In another immunization protocol, mice were injected intramuscularly in the 
tibialis anterior with 10 10 adenovirus particles encoding an NS3NS4NS5'a fusion 
protein. An intraperitoneal booster injection of 1 0 7 pfu of W-NS5a or an 
intramuscular booster injection of 10 10 adenovirus particles encoding NS3NS4NS5a 

1 0 was provided 6 weeks later. 

EXAMPLE 2 

Immunization with DNA encoding an NS3NS4NS5a fusion protein activates 
HCV-specific CD8 + T cells. 

1 5 51 Cr Release Assay. A 51 Cr release assay was used to measure the ability of 

HCV-specific T cells to lyse target cells displaying an NS5a epitope. Spleen cells 
were pooled from the immunized animals. These cells were restimulated in vitro for 
6 days with the CTL epitopic peptide p214K9 (2152-HEYPVGSQL-2160; SEQ ID 
NO: 1) from HCV-NS5a in the presence of IL-2. The spleen cells were then assayed 

20 for cytotoxic activity in a standard 51 Cr release assay against peptide-sensitized target 
cells (L929) expressing class I, but not class II MHC molecules, as described in 
Weiss (1980) J. Biol. Chem. 255:9912-9917. Ratios of effector (T cells) to target (B 
cells) of 60:1, 20:1, and 7:1 were tested. Percent specific lysis was calculated for 
each effector to target ratio. 

25 . The results of the assays are shown in Tables 1 and 2. Table 1 demonstrates 

that immunization with plasmid DNA encoding an NS3NS4NS5a fusion protein 
activates CD8 + T cells which recognize and lyse target cells displaying an NS5a 
epitope. Surprisingly the NS5a polypeptide of the NS3NS4NS5a fusion protein was 
able to activate T cells even though the NS5a polypeptide was present in a fusion 

30 protein. 
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Similarly, Table 2 demonstrates that delivery of theNS3NS4NS5a fusion 
protein to mice by means of an adenovirus vector also activates CD8 + T cells which 
recognize and lyse target cells displaying an HCV NS5a epitope. Thus, 
immunization with either "naked" (plasmid) DNA encoding an NS3NS4NS5a fusion 
5 protein or adenovirus vector-encoded fusion protein can be used to activate HCV- 
specific T cells. 

EXAMPLE 3 

Immunization with DNA encoding an NS3NS4NS5a fusion protein activates 

10 HCV-specific CD8 + T cells which express IFN-7. 

Intracellular Staining for Interferon-gamma (IFN-y). Intracellular staining 
for IFN-y was used to identify the CD8 + T cells that secrete IFN-y after in vitro 
stimulation with the NS5a epitope p2 14K9. Spleen cells of individual immunized 
mice were restimulated in vitro either with p214K9 or with a non-specific peptide for 

15 6-12 hours in the presence of IL-2 and monensin. The cells were then stained for 
surface CD8 and for intracellular IFN-y and analyzed by flow cytometry. The 
percent of CD8 + T cells which were also positive for IFN-y was then calculated. 
The results of these assays are shown in Tables 1 and 2. Table 1 demonstrates that 
CD8 + 1 cells activated in response to immunization with plasmid DNA encoding an 

20 NS3NS4NS5a fusion protein also express IFN-7. Immunization with an 

NS3NS4NS5a fusion protein encoded in an adenovirus also results in CD8 + HCV- 
specific T cells which express IFN-y, although to a lesser extent than immunization 
with a plasmid-encoded NS3NS4NS5a fusion protein (Table 2). 
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EXAMPLE 4 

Immunization with DNA encoding an NS3NS4NS5a fUsion protein 
stimulates proliferation of HC V-specific CD4 + T cells. 

Lymphoproliferation assay. Spleen cells from pooled immunized mice were 
5 depleted of CD8 + T cells using magnetic beads and were cultured in triplicate with 
either p222D, an NS5a-epitopic peptide from HCV-NS5a (2224- 
AELffiANLLWRQEMG-2238; SEQ ID NO:2), or in medium alone. After 72 hours, 
cells were pulsed with lji Ci per well of 3 H-thymidine and harvested 6-8 hours later. 
Incorporation of radioactivity was measured after harvesting. The mean cpm was 
10 calculated. 

As shown in Table 3, immunization with a plasmid-encoded NS3NS4NS5a 
fusion protein stimulates proliferation of CD4 + HCV-specific T cells. Immunization 
with an adenovirus vector encoding the fusion protein also resulted in stimulated 
proliferation of CD4 + HCV-specific T cells (Table 4). 

15 



Table 3. HCV-NS5a-Specific CD4+ T Cells in Mice 
Immunized with NS5a or NS345a DNA 


Mean CPM 


NS5a DNA 


NS345a DNA 


p222D 


media 


p222D 


media 


4523 


740 


4562 


861 


(x6.1) 


(x5.3) 



p222D is a CD4+ epitopic peptide (aa: 2224-AELIEANLLWRQEMG-223 8, SEQ ID 
25 NO:2)£romHCV-NS5a 



54 



WO 2004/039950 



PCT/US2003/033610 



Table 4. HCV-NS5-Specific CD4+ T Cells Primed by 
Adenovirus or DNA Encoding for NS345a 


MeanCPM 


NS345aAdeno 


NS345aDNA 


p222D 


media 


p222D 


media 


896 


357 


1510 


385 


(x2.5) 


(x3.9) 



p222D is a CD4+ epitopic peptide (aa: 2224-AELIEANLLWRQEMG-223 8, SEQ ID 
10 NO:2) from HCV-NS5a 

EXAMPLES 

Efficiency of NS345a-encoding DNA Vaccine Formulations to prime CTLs . 
in mice. 

15 Mice were immunized with either 10-100 ]ig of plasmid DNA encoding 

NS345a fusion protein as described in Example 1, with PLG-linked DNA encoding 
NS345a, described below, or with DNA encoding NS345a, delivered via 
electroporation (see, e.g., U.S. Patent Nos. 6,132,419; 6,451,002, 6,418,341, 
6233,483, U.S. Patent Publication No. 2002/0146831; and International Publication 

20 No. WO/0045823, for this delivery technique). The immunizations were followed by 
a booster injection 6 weeks later of 1 x 10 7 pfu vaccinia virus encoding NS5a, 
plasmid DNA encoding NS345a or plasmid DNA encoding NS5a each as described 
in Example 1. 

PLG-delivered DNA. The polylactide-co-glycolide (PLG) polymers were 
25 obtained from Boehringer Ingelheim, U.S.A. The PLG polymer used in this study 
was RG505, which has a copolymer ratio of 50/50 and a molecular weight of 65 kDa 
(manufacturers data). Cationic microparticles with adsorbed DNA were prepared 
using a modified solvent evaporation process, essentially as described in Singh et al., 
Proa Natl. Acad Set USA (2000) 97:811-816. Briefly, the microparticles were 
30 prepared by emulsifying 10 ml of a 5% w/v polymer solution in methylene chloride 
with 1 ml of PBS at high speed using an IKA homogenizes The primary emulsion 
was then added to 50ml of distilled water containing cetyl trimethyl ammonium 
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bromide (CTAB) (0.5% w/v). This resulted in the formation of a w/o/w emulsion 
which was stirred at 6000 rpm for 12 hours at room temperature, allowing the 
methylene chloride to evaporate. The resulting microparticles were washed twice in 
distilled water by centrifiigation at 10,000 g and freeze dried. Following preparation, 

5 washing and collection, DNA was adsorbed onto the microparticles by incubating 
100 mg of cationic microparticles in a Img/ml solution of DNA at 4 C for 6 hours. 
The microparticles were then separated by centrifiigation, the pellet washed with TE 
buffer and the microparticles were freeze dried. 

CTL activity and IFN-y expression were measured by 51 Cr release assay or 

10 intracellular staining as described in examples 2 and 3 respectively. The results are 
shown in Table 5. 

Results demonstrate that immunization using plasmid DNA encoding for 
NS345a to prime mice results in activation of CD8+ HCV specific T cells. 
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Table 5: Efficiency of NS345a-Encoding DNA Vaccine Formulations to Prime CTLs in 

Mice 








ICS for IFN-gamma 
(%CD8+ cells that are EFN- 
g+) 








NS345a 

DNA 

Vaccines 


Boost 


Mean 


SdtdevP 


#of 
mice 
tested 


% 
respon 
d- • 
ing 


#of 
expts 


fold 
increase 

vs. 
'naked' 

DNA 


CTL 
activ 
i- 
ty? 


NS345a 
DNA 


WNS5a 


1.02 


1.70 


41 


68% 


10 


N/A 


YES 


NS345a 
DNA 


NS345a 
DNA 


0.02 


0.04 


22 


5% 


5 


N/A 


YES 


NS345a 
DNA 


NS5a 
DNA 


0.22 


0.21 


24 


63% 


5 


N/A 


YES 


NS345a 
DNAeV 
(electro- 
poration) 


WNS5a 


5.00 


4.36 


7 


100% 


2 


4.90 


YES 


PLGNS34 
5a DNA 


WNS5a 


2.65 


2.54 


6 


100% 


2 


2.60 


YES 


PLGNS34 
5a DNA 


NS5a 
DNA 


0.33 


0.24 


15 


80% 


3 


1.50 


YES 



EXAMPLE 6 

25 Immunization routes and replicon particles SINCR (DC+) encoding for NS345a 

Alphavirus replicon particles, for example, SINCR (DC+) were prepared as 
described in Polo et al, Proa Natl Acad. ScL USA (1999) 26:4598-4603. Mice were 
injected with 5 x 10 6 IU SINCR (DC+) replicon particles encoding for NS345a 
intramuscularly (IM) as described in Example 1, or subcutaneously (S/C) at the base of the 

30 tail (BoT) and foot pad (FP), or with a combination of 2/3 of the DNA delivered via IM 
administration and 1/3 via a BoT route. The immunizations were followed by a booster 
injection of vaccinia virus encoding NS5a as described in Example 1. 

IFN-y expression was measured by intracellular staining as described in Example 3. 
The results are shown in Table 6. The results demonstrate that immunization via SINCR 

35 (DC+) replicon particles encoding for NS345a by a variety of routes results in CD8+ HCV 
specific T cells which express IFN-y. 
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% respond] 
mice 


100% 


100% 


100% 
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ICSforlFN-gamma 
%CD8+ cells that are IFN-g 


# of mice 
tested 


co 


co 


CO 


able 6: Immunization Routes and SINCR (DC+) Replicon Partic 

(all mice WNSSa challenged) 


SdtdevP 


0.63 


0.29 


2.00 




Mean 


1.11 


0.62 


2.43 




Immunization Route 


100%IM(ta) 


100%S/C(BoT+FP) 


P 
o 

e 

+ 
g 

S 


H 




Vaccines 


X 

V 
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a 

1 

CO 


vo 
O 

r— t 

1 

GO 


SINCR (DC+) 5X1 0 6 
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EXAMPLE 7 

SINCR (DC+) vs SINDC (LP) replicon particles encoding for NS345a 
Alphavirus replicon particles, for example, SINCR (DC+) and SINCR (LP) were 
prepared as described in Polo et al., Proc. Natl. Acad. Sci. USA (1999) 96:4598-4603. Mice 
5 were immunized with 1 x 10 3 to 1 x 10 7 IU of SINCR (DC+) or SINCR (LP) replicon 

particles encoding for NS345a, by intramuscular injection into the tibialis anterior, followed 
by a booster injection of 10 7 pfu vaccinia virus encoding NS5a at 6 weeks. 

IFN-y expression was measured by intracellular staining as described in Example 3. 
Administration of an increase in the number of SINCR (DC+) replicon particles encoding 
10 NS345a resulted in an increase in % of CD8+ T cells expressing IFN-y. 

EXAMPLE 8 

Alphavirus replicon priming, followed by various boosting regimes. 
Alphavirus replicon particles, for example, SINCR (DC+) were prepared as 
15 described in Polo et al., Proc. Natl Acad. Sci. USA (1999) 96:4598-4603. Mice were 
primed with SINCR (DC+), 1.5 x 10 6 IU replicon particles encoding NS345a, by 
intramuscular injection into the tibialis anterior, followed by a booster of either 10-100 \ig 
of plasmidDNA encoding forNSSa, 10 10 adenovirus particles encoding NS345a, 1.5 x 10 6 
IU SINCR (DC+) replicon particles encoding NS345a, or 10 7 pfix vaccinia virus encoding 
20 NS5aat6weeks. 

IFN-y expression was measured by intracellular staining as described in Example 3. 
The results are shown in Table 7. The results demonstrate that boosting with vaccinia virus 
encoding NS5a DNA results in the strongest generation of CD8+ HCV specific T cells 
which express IFN-y. Boosting with plasmid encoding NS5a DNA also results in a good 
25 response, while lesser responses are noted with adenovirus NS345a or SINCR DC+ boosted 
animals. 
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Table 7: Alphavirus Replicon Particle Priming, Followed by Various Boosting Regimens 






ICS for IFN-gamma 
(%CD8+ cells that are EFN-g+) 




Vaccines 


Boost 


Mean 


SdtdevP 


# of mice 
tested 


#of 
expts 


% 

respond- 
ing mice 


sincr (dc+) 

1.5X10 6 


NS5a DNA 


0.46 


0.36 


4 


1 


75% 


SINCR (DC+) 
1.5X10 6 


Adeno NS345a 
(10X10 10 ) 


0.04 


0.04 


4 


1 


25% 


SINCR (DC+) 
1.5X10 6 


SINCR (DC+) 
1.5X10 6 


0.06 


0.06 


8 


2 


25% 


SINCR (DC+) 
1.5X10 6 


WNS5a 
(1X10 7 ) 


2.43 


2.45 


4 


1 


100% 



15 

EXAMPLE 9 

Alphaviruses expressing NS34 5 a 

Alphavirus replicon particles, for example, SINCR (DC+) and SINCR (LP) 
were prepared as described in Polo et al., Proc. Natl Acad. Sci. USA (1999) 

20 96:4598-4603. Mice were immunized with 1 x 10 2 to 1 x 10 6 IU SINCR (DC+) 
replicons encoding NS345a via a combination of delivery routes (2/3 IM and 1/3 
S/C) as well as by S/C alone, or with 1 x 10 2 to 1 x 10 6 IU SINCR (LP) replicon 
particles encoding NS345a via a combination of delivery routes (2/3 EM and 1/3 S/C) 
as well as by S/C alone. 

25 The immunizations were followed by a booster injection of 10 7 pfu vaccinia virus 
encoding NS5a at 6 weeks. 

IFN-y expression was measured by intracellular staining as described in 
Example 3. The results are shown in Figure 6. The results indicate activation of 
CD8+ HCV specific T cells. 
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EXAMPLE 10 

Efficiency ofNSSa encoding DNA vaccine formulations to prime CTLs in 

mice 

Mice were immunized with either 10-100 ng of plasmid DNA encoding NS5a 
5 as described in Example 1 or with PLG-linked DNA encoding NS5a as described in 

Example 5. The immunizations were followed by a booster injection at 6 weeks of 

either 10-100 ng of plasmid DNA encoding for NS5a, 10 10 adenovirus particles 

encoding NS345a, 1.5 x 10 6 IU SDSTCR (DC+) replicon particles encoding NS345a, 

or 10 7 pfu vaccinia virus encoding NS5a. 
10 CTL activity and IFN-y expression were measured by the methods described 

in Examples 2 and 3. 

The results are shown in Table 8. The results demonstrate that priming with 

plasmid DNA encoding for NS5a or PLG-linked DNA encoding NS5a results in 

activation of CD8+ HCV specific T cells. 

15 
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Mean 
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EXAMPLE 11 

Efficiency ofNS345b-encodingDNA vaccine formulations to prime CTLs in 

mice 

5 Mice were immunized with 10-1 00 jig of plasmid DNA encoding NS34b by 

intramuscular injection to the tibialis anterior or with PLG linked DNA encoding 
NS5a as described in Example 5. The immunizations were followed by a booster 
injection of plasmid DNA encoding for NS5a as described in Example 1. 

CTL activity and IFN-y expression were measured by the methods described 
10 in Examples 2 and 3. 

The results are shown in Table 9. The results demonstrate that priming with 
plasmid DNA encoding NS345b or PLG-linked NS345b results in activation of 
CD8+ HCV specific T cells. 



( 
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CTL 
activity? 


YES 


YES 


Table 9: Efficiency of NS345b-Encoding DNA Vaccine Fonnulations to Prime CTLs in Mice 




fold increase 
vs. 'naked' 
DNA 


N/A 


1.67 




O QLi 

* 3 




m 


ICSforlFN-gamma 
(%CD8+ cells that are IFN-g+) 
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responding 


%09 
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# of mice 
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T— t 


SdtdevP 


0.16 


0.33 




Mean 


0.18 


0.30 




Boost 


NS5aDNA 


NS5aDNA 




NS345 DNA 
Vaccines 


NS345 DNA 


PLGNS345 DNA 
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EXAMPLE 12 

Administration ofDNA via separate plasmids 

Mice were immunized with 100 \ig plasmid DNA encoding for NS345a or 
with 100 ng PLG-linked DNA encoding NS345a. Additionally, separate DNA 
5 plasmids encoding NS5a, NS34a, and NS4ab (33.3 ng each) were administered 
concurrently to another group of mice. Finally, PLG-linked DNA encoding NS5a, 
NS34a, and NS4ab (33.3 ng each) were administered concurrently to another group 
of mice. The immunizations were followed by a booster injection of IxlO 7 pfu 
vaccinia virus encoding NS5a, 6 weeks post first immunization. 
10 IFN-y expression was measured by the method described in Example 3. The 

results are shown in Figure 7. The results demonstrate a particularly vigorous 
response in the activation of CD8+ HCV specific T cells when the DNA is broken 
down into smaller sub units and linked to PLG. 

15 EXAMPLE 13 

Immunogenicity ofNS34SCore l2 r!SCOMS in Mice 
Groups of 10 C57 black mice were immunized IM at 0, 21 and 60 days with 
the formulations shown in Table 10. The NS345Core 121 -PLGdss group received a 
vaccine dose of 50 \il in each leg whereas the other vaccine groups received a vaccine 
20 dose of 50 \i\ in one leg. 

NS345Core 121 -ISCOMS were comprised of ammo acids 1242 to 3011 and 1- 
121 and the HCV polyprotein, numbered relative to HCV- land were adsorbed to 
ISCOMS with a ratio of protein to QH of approximately 8:1, using standard 
techniques. See, e.g., International Publication No. WO 01/37869A. 
25 Core-ISCOMS including an HCV core protein fragment from the region 

spanning amino acid positions 1-191 of the HCV polyprotein, numbered relative to 
HCV-1, with a ratio of protein to QH of 1:1, were produced using standard 
techniques. See, e.g., International Publication No. WO 01/37869A. 

NS345Corei2i was formulated in 0.1% SDS in PBS and contained DTT. 
30 Protein was diluted in PBS and mixed 1 : 1 with MF59 (see, Ott et al, "MF59 ~ 
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Design and Evaluation of a Safe and Potent Adjuvant for Human Vaccines" in 
Vaccine Design: The Subunit and Adjuvant Approach (Powell, M.F. and Newman, 
M J. eds.) Plenum Press, New York (1995) pp. 277-296; and U.S. Patent No. 
6,299,884) prior to immunization. 
5 For NS345Core 121 -PLGdss, PLG microparticles produced as described above 

were treated with 3-(trimethylsilyl)-l-propanesulfonic acid (DSS) to enhance 
adsorption of antigen. DSS is commercially available from, e.g., Sigma Chemical 
Co., St. Louis, MO. NS345Core l21 was adsorbed thereto using standard techniques 
(see, International Publication No. WO 00/050006). The NS345Core I21 -PLGdss was 

1 0 mixed with MF59 prior to immunization. 

As shown in Table 10, NS345Core 12r ISCOMS produced antibody response 
only to NS5 in immunized C57 black mice. Higher levels of antibodies to NS5 were 
produced in mice immunized with NS345Core 121 adjuvanted with MF59, however no 
antibody response to core, NS3 or NS4 was produced with this adjuvant either. 

1 5 Mice immunized with Core -ISCOMS produced antibodies to core. In 

contrast, NS345Core 12I -PLGdss immunized mice produced significantly higher 
antibodies to NS5 than NS345Core I2I -ISCOMS. In addition, NS345Core l2 ,-PLGdss 
immunized mice produced antibodies to NS3 and some antibody response to core, 
but no antibodies to NS4. 
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Table 10. Immunogenicity of NS345Core 121 -Iscoms in Mice. Geometric mean EIA 
antibody titers to core and nonstructural proteins are shown. The number of 
responding mice per group are also listed. 


Vaccine 


IM Protein 


Anti-Core 


Anti-C33C 


Anti-ClOO 


Anti-NS5 




Dose (ug) a 


Antibody 


(NS3) 


(NS4) 


Antibody 






EIA GMT 


Antibody 
EIA GMT 


Antibody 
EIA GMT 


EIA GMT 


NS345Core 


6.0, 6.0, 6.0 b 


<10 


<10 


<10 


31 


m ISCOMS 




(0/10) 


(0/10) 


(0/10) 


(7/1 Q) 


Core- 


6.0, 6.0, 6.0 C 


188 


<10 


<10 


<10 


ISCOMS 




(9/10) 


(1/10) 


(0/10) 


(2/10) 


NS345Core 


6.0, 6.0, 6.0 


<10 


<10 


<10 


279 


12 ,MF59 




(2/9) 


(1/9) 


(0/9) 


(9/) 


NS345Core 


10, 10, 10 


5 


50 


<10 


419 


121 PLG- 




(6/10) 


(9/10) 


(2/10) 


(9/9) 


dss/MF59 













i 



a Groups of 10 C57 black mice were immunized IM at 0, 21 and 60 days. Serum was 
obtained after the last immunization. The NS345 Core l21 -PLGdss group received vaccine 
dose of 50 \i\ in each leg whereas the other vaccine groups received vaccine dose of 50 fj,l in 
20 one leg. 

"The ratio of protein to QH was approximately 8:1. 
The ratio of protein to QH was approximately 1:1. 

EXAMPLE 14 

25 Immunogenicity of Different Formulations of NS345 Core m or NS345 in 

Mice 

Groups of 10 C57 black mice were immunized IM at 0, 30 and 60 days with 
the formulations shown in Tables 1 1 and 12. For the studies shown in Table 1 1, the 
NS345 and NS345Core ul protein concentration was 10 \xg per dose, and for those in 
30 Table 12, the concentration was 5 ng per dose. 
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For PLG-NS345 (amino acids 1242 to 301 1 of the HCV polyprotein) and 
PLG-NS345Core 121 (amino acids 1242-301 1 and 1-121 of the HCV polyprotein), 
PLG microparticles were prepared and NS345 or NS345Core 12 i were adsorbed 
thereto using standard techniques, as described above. 
5 For PLG-NS345 + PLG-CTAB-E1E2 DNA, PLG microparticles were 

prepared and NS345 was adsorbed to the microparticles as described above. E1E2 
DNA was produced as follows. Mammalian expression plasmid pMH-ElE2-809 
(Figure 4, ATCC Deposit No. PTA-3643) encodes an E1E2 fusion protein which 
includes amino acids 192-809 of HCV la (see, Choo et al., Proa Natl. Acad. Set 

10 USA (1991) 88:2451-2455). Chinese Hamster Ovary (CHO) cells were used for 
expression of the HCV E1E2 sequence from pMH-ElE2-809. Li particular, CHO 
DG44 cells were used. These cells, described by Uraub et al., Proa Natl Acad. Set 
USA (1980) 77:4216-4220, were derived from CHO K-l cells and were made 
dihydrofolate reductase (dhfr) deficient by virtue of a double deletion in the dhfr 

15 gene. DG44 cells were transfected with pMH-ElE2-809. The transfected cells were 
grown in selective medium such that only those cells expressing the dhfr gene could 
grow (Sambrook et al., supra). Isolated CHO colonies were picked (-800 colonies) 
into individual wells of a 96-well plate. From the original 96-well plates, replicates 
were made to perform expression experiments. The replicate plates were grown until 

20 the cells made a confluent monolayer. The cells were fixed to the wells of the plate 
and permeablized using cold methanol. Anti-El and anti-E2 antibodies, 3D5C3 and 
3E5-1 respectively, were used to probe the fixed cells. After adding an anti-mouse 
HRP conjugate, followed by substrate, the cell lines with the highest expression were 
determined. The highest expressing cell lines were then expanded to 24-well cluster 

25 plates. The assay for expression was repeated, and again, the highest expressing cell 
lines were expanded to wells of greater volume. This was repeated until the highest 
expressing cell lines were expanded from 6-well plates into tissue culture flasks. At 
this point there was sufficient quantity of cells to allow accurate count and harvest of 
the cells, and quantitative expression assays were done. An ELISA was performed 

30 on the cell extract, to determine high expressors. 
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To produce the PLG-CTAB-E1E2 DNA, PLG microparticles were treated 
with CTAB as described above (see, International Publication No. WO 00/050006). 

For PLG-NS345Core 121 + E1E2 DNA PLG-NS345Core l21 and E1E2 DNA 
were produced as described above. 
5 For PLG-NS345 or PLG-NS345Core 121 + MF59, PLG-NS345 or PLG- 

NS345Core 121 was combined with MF59 as described above. 

For PLG-NS345 or PLG-NS345Core 121 + CTAB-CpG, NS345 or 
NS345Core, 21 was adsorbed to PLG as described above. The CpG molecule used 
was S'-TCCATGACGTTCCTGACGTT-S' and this was treated with CTAB, as 
1 0 described above. 

For PLG-NS345 or PLG-NS345Core 121 + QS21, the saponin adjuvant QS21 
was combined with the PLG-HCV proteins. 

For PLG-NS345 or PLG-NS345Core 121 + CTAB-CpG + MF59, the various 
components, as described above, were combined. 
1 5 The remaining adjuvants used in the studies and shown in the tables are self- 

explanatory. 

The results of these studies are shown in Tables 1 1 and 12. As can be seen in 
Table 11, none of the formulations produced antibody responses to core, NS3 or NS4 
antigens. However , PLG-NS345+CTAB-CPG in MF59 produced the highest 

20 antibody titers to NS5. PLG-NS345Core 121 +QS21, PLG-NS345+CTAB-CPG, PLG- 
NS345Core l2l +CTAB-CPG, and PLG-NS345 4- QS21 produced moderate antibody 
titers to NS5. The other formulations produced very low antibody titers to NS5. 

As can be seen in Table 12, NS345Core 121 /MF59/MPL and 
NS345Core 121 /MF59/CpG formulations produced very high antibody titers to 

25 NS345Core 121 . NS345Core 121 /MF59, NS345/MF59/CpG, and NS345Core 121 

/MF59/Chol/QS21 formulations produced moderate antibody titers to NS345Core, 21 . 
The other formulations produced very low or no antibody titers to NS345Core 121 . 
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Table 11. Immunogenicity of different formulations of HCV NS345Core 12l orNS345 in 
Mice. Geometric mean EIA antibody titers to core and nonstructural proteins are shown. 


Vaccine* 


Anti-Core 
Antibody EIA 
GMT 


Anti-C33C 
(NS3) 
Antibody EIA 
GMT 


Anti-ClOO 
(NS4) 
Antibody EIA 
GMT 


Anti-NS5 
Antibody EIA 
GMT 


PLG-NS345 


<10 


<10 


<!0 


10 


PLG- 
NS345Core 121 


<10 


<10 


<10 


15 


PLG- 
NS345+PLG- 
CTAB-E1E2 

DNA 


<10 


11 


<10 


23 


PLG- 
NS345Core 121 
+E1E2 DNA 


<10 


<10 


<10 


20 


PLG-NS345 
+MF59 


<10 


<10 


<10 


70 


PLG- 
NS345Core 121 
+ MF59 


<10 


<:10 


<10 


26 


PLG-NS345 + 
CTAB-CPG 


<10 


<10 


<10 


350 


PLG- 

NS345Core 121 
+ CTAB-CPG 


<10 


<10 


<10 


271 


PLG-NS345 
+QS21 


<10 


<10 


<10 


201 


PLG- 
NS345Core, 21 
+ QS21 


<10 


<10 


<10 


505 
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PLG-NS345 + 
CTAB-CPG + 
MF59 


<10 


<10 


<10 


1471 




PLG- 


<10 


<10 


<10 


63 


5 


NS345Core 121 












'+ 












CTAB+MF59 











"Groups of 10 C57 black mice were immunized IM at 0, 30 and 60 days. Serum was 
10 obtained after the last immunization. The NS345 or NS345Core 121 protein concentration 
was 10 \ig per dose. 
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Table 12. Immunogenicity of different formulations of HCVNS345Corei2, or HCV 
NS345 in Mice. Geometric mean EIA antibody titers to NS345Core 121 protein are shown. 


Vaccine" 


Anti-NS345Core 121 Antibody EIA GMT 


NS345Core 121 /MF59 


328 


NS345Core 12 ,/MF59/CpG 


7,926 


NS34a+NS5B+Core/MF59 


12 


NS34a+NS5B+Core/MF59/CpG 


5 


PLG-NS345Core 121 /MF59 


<10 


PLG-NS345Core m /MF59/CpG 


<10 


PLG-NS345Core 121 /PLG-CpG 


9 


NS345Core 12 ,/alum phosphate 


34 


NS345Core 121 /alum phosphate//CpG 


950 


NS345/MF59/CpG 


511 


PLG-NS345/PLG/CpG 


117 


NS345Core 121 /MF59/MPL 


' 10,292 


NS345Core 12 ,/MF59/Chol/QS21 


698 


NS345Core 121 /Alum phosphate/MPL 


23 



20 fl Groups of 10 C57 black mice were immunized IM at 0, 30 and 60 days. Serum was 

obtained after the last immunization. The NS345 or NS345Core m protein concentration was 
5 per dose. 

25 
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EXAMPLE 15 

Lymphoproliferative Response of Different Formulations ofNS345Core 121 or 
NS34A + NS35B + Core in Mice 

Groups of 8 C57 black mice were immunized IM at 0, 30 and 60 days with 
5 the formulations shown in Table 13 and are as described above. Spleens were 

obtained after the last immunization. The NS345Core I21 protein concentration was 
25 |xg per dose. The NS34a, NS5b and core doses were 3 |ig each. 

The results of this study are shown in Table 13. As can be seen, 
NS345Core 12I /Alum/CpG, PLG-NS345Core l21 /PLG/CpG, NS34a+NS5B+ 
1 0 Core/MF59/CpG and PLG-NS345Core 12I /MF59/CpG formulations demonstrated 
strong LPA responses toNS5» NS34 and core antigens. The NS345Core 12 j/MF59 
formulation also produced a strong LPA response to NS5 and NS34. Core was not 
tested. 

Moderate LPA responses were observed to NS5, NS34 and Core antigens with PLG- 
15 NS345Core m /MF59 and NS34a +NS5B + Core/MF59 formulations. The 

NS345Core 121 /MF59/CpG formulation may not have been administered properly in 
that no LPA response was observed in this experiment. In a subsequent experiment 
as shown in Table 14, an LPA was observed to this formulation. 

Groups of 8 C57 black mice were immunized once IM with the formulations 
20 shown in Table 14, produced as described above. Draining lymph nodes were* 
obtained. 

The NS345Core l21 protein concentration was 25 jag per dose. 

The results of this study are shown in Table 14. As can be seen in Table 14, 
all the formulations tested produced a strong LPA response to NS5, NS34 and Core 
25 as well as the NS345Core, 21 . 
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Table 13. Lymphoproliferative response of different formulations of HCVNS345Core m 


or NS34A+NS5B+Core in Mice. LP A responses (cpm) to core and nonstructural proteins 


are shown. The number of mice in each group responding is also indicated in. 




parentheses. 














HIV-2 env 






SOD-C200 


SOD-C22-3 


(background 


Vaccine" 


oOD-NSj 




(Core) 


control) 


NS345Core m /MF59 


2250 


1800 


ND 


144 




(6/8) 


(4/8) 






NS345Core m /MF59/CpG 


80 


80 


ND 


138 


PLG-NS345Core m /MF59 


560 


120 


510 


93 




(2/8) 


(2/8) ' 


(2/8) 




PLG- 


1600 


1500 


620 


75 


NS345Core 121 /MF59/CpG . 


(6/8) 


(6/8) 


(8/8) 




NS34a+NS5B+Core/MF59 


564 


710 


265 


76 




(8/8) 


(8/8) 


(8/8) 




NS34a+NS5B+ 


1523 


885 


446 


67 


Core/MF59/CpG 


(8/8) 


(8/8) 


(6/8) 




PLG- 


3675 


2860 


370 


88 


NS345Core 121 /PLG/CpG 


(8/8) 


(8/8) 


(8/8) 




NS345Core m /Alum/CpG 


8450 


7940 


1040 


82 




(8/8) 


(8/8) 


(6/8) 





°Qroups of 8 C57 black mice were immunized IM at 0, 30 and 60 days. Spleens were 
obtained after the last immunization. TheNS345Core 121 protein concentration was 25 u,g 
per dose. The NS34a, NS5B and Core doses were 3 u,g each. 

25 
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EXAMPLE 16 

Immunogenicity of Recombinant HCV Protein Vaccines Adjuvanted with 
ISCOMS in Rhesus Macaques 

The safety and immunogenicity of HCV proteins completed with the 
5 adjuvant, Iscomatrix, was studied in Rhesus macaques. Three groups made up of 
four animals each were immunized IM as detailed below at week 0, 4 and 8 weeks. 
Vaccines were prepared as described above. The ISCOMS used lacked QH-A. 



Group 
Number 


n 


Vaccine 


Delivery 


1 


4 


Core-ISCOM (50 ug in 1 ml) 


0.5 ml R Leg 
0.5 ml L Leg 


2 


4 


NS345Core m -ISCOM(l rag in 1 
ml) 


0.5 ml R Leg 
0.5 ml L Leg 


3 


4 


Core-ISCOM (25 ug in 0.5 ml) 
and NS5b-ISCOM (50 ug in 0.35 
ml) 


0.5 ml Core-ISCOM R Leg 
0.35 mlNS5b-ISCOM L Leg 



15 Bleeds occurred as follows and immunogenicity was determined by CTL assays, 
lymphoproliferation assays, FACS analysis and antibody response a previously 
described (Palakos, et al. (2001) J. of Immunology 166:3589). 
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Week 


Bleed date 


Immunized 


-10 






-1 


X 




0 




X 


2 


X 




4 




•x 


6 


X 




8 




X 


10 


X 





The immunogenicity of the different HCV recombinant protein vaccines is 
shown in Tables 15-17. ' 
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Table 15. The Immunogenicity of HCV Core-ISCOMS vaccine two weeks post 2 nd 
immunization and post 3" 1 immunization as assessed by CTL assays, CD8+ FACS 
analysis, LPA stimulation index and CD4+ FACS analysis 




2 weeks post 3° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 
3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X020 


-(-) 










■K-) 










N001 












+(-) 










N086 


-(-) 










+(-) 










X010 


-(-) 










+01) 

































15 





2 weeks post 2° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 
3 


NS4 


NSSa 


NS5b 


C 


NS3 


NS4 


NSSa 


NS5b 


X020 


-(-) 










+/-(-) 










N001 


-(-) 










-(8) 










N086 


-(-) 










+(-) 










X010 


-(-) 










+/- 
(12) 
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Table 16. The Immunogenicily of HCV NS345Core m -ISCOMS vaccine two weeks post 
2 nd immunization and post 3 rd immunization as assessed by CTL assays, CD8+ FACS 
analysis, LPA stimulation index and CD4+ FACS analysis 




2 weeks post 3° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X016 


-(-) 


+(+) 


-(-) 


-(-) 


<-) 


-(-) 


+(-) 


<-) 


+/-(-) 


+/•<-) 


X008 


-(-) 


<-) 


-(-) 


<-) 


-(-) 


-(-) 


-(-) 


-<-) 


<5) 


-(-) 


X021 


-(-) 


<-) 


-(-) 


-(-) 


-(-) 


-(-) 


-(-) 


<-) 


<-) 


-(-) 


X023 


+/-(-) 


-(-) 


•<-) 


+H-) 


-(-) 


<-) 


+/-(- 
) 


-(-) 


-(-) 


-(-) 





2 weeks post 2° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X016 


-(-) 


■K+) 


<-) 


■K+) 


+(+) 


<-) 


+(-) 


+K-) 


+(-) 


+(-) 


X008 


+ (•) 


•K+) 


-(-) 


■K+) 


+(+) 


-(-) 


+(-) 


-(-) 


+-(-) 


■K-) 


X021 


-(-) 


-(-) 


+/-(-) 


<-) 


+/-(-) 


-(-) 


-(-) 


+><-) 


<-) 


-(-) 


X023 


+/- 

(+) 


■K+) 


-(-) 




-K-) 


<-) 


+(-) 


<-) 


<-) 


7 
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Table 17. The Immunogenicity of HCV Core-ISCOMS + NS5WSCOMS vaccine two 
weeks post 2 nd immunization and post 3 rd immunization as assessed by CTL assays, CD8+ 
FACS analysis, LPA stimulation index and CD4+ FACS analysis 




2 weeks post 3° 


Macaqu 
e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 
3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X022 


-K-) 








-(-) 


-(8) 








+/-(11) 


X014 


-(-) 








-(-) 


-K6) 








+(11) 


N154 


-(-) 








-(-) 


-(-) 








+(-) 


N173 


-(-) 








<-) 


-(-) 








+/-(-) 














-i 













2 weeks post 2° 


Macaqu 

e# 


CD8+ICS(CTL) 


CD4+ICS(LPASI) 


C 


NS 
3 


NS4 


NS5a 


NS5b 


C 


NS3 


NS4 


NS5a 


NS5b 


X022 


-(-) 








-(+) 


-(-) 








-(-) 


X014 


+(-) 








+A(-) 


-(-) 








+(-) 


N154 


-(-) 








-(-) 


-(6) 








+(8) 


N173 


-(-) 








-(-) 


+/-(-) 








+(6) 

























25 As can be seen in Table 15, the HCV Core-ISCOM vaccine produced no CTL 

positive responses in any of the 4 immunized macaques after the second or third 
* immunizations. No positive CD8 y-interferon and/or TNF-a intracellular staining 
was also observed, although backgrounds were high in these particular arrays. At 
least two of four macaques produced a strong LPA response after the second 

30 immunizations, but only one remained positive after the third immunization. Two of 
four macaques produced positive CD4 intracellular staining after the second 
immunization and four of four after the third immunization. 
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As shown in Table 16, the HCV NS345Core 121 -ISCOM vaccine after the 
second immunization produced CTL positive responses to peptide pools representing 
two or more HCV proteins in three of four macaques (two of these macaques had 
responses to peptide pools from NS3, NS5a andNSSb, one to peptide pools from 
5 core and NS3). CD8 positive y-interferon and/or TNF-a intracellular staining to 

peptide pools representing two or more HCV proteins was positive in at least three of 
four macaques. One of four macaques produced a strong LPA response. At least 
three of four macaques produced CD4 positive intracellular staining to two or more 
HCV proteins. After the third immunization, only one of four macaques had a 

1 0 positive CTL response, CD8 positive intracellular staining and C04 positive 

intracellular staining. One other macaque had a positive LPA response and weak 
CD8+ CD4 intracellular staining, This decline in immunogenicity was likely due to 
instability of the vaccine formulation (see below). / 
As shown in Table 17, the HCV Core-ISCOM + NS5b-ISCOM vaccine 

1 5 produced a CTL positive response to NS5b in one of the 4 immunized macaques 
after the second immunization which did not remain positive after the third 
immunizaton. CD8 positive intracellular positive staining was observed in one of 
four animals post second. Two of four macaques produced a strong LPA response 
after the second immunization which did not remain positive after the third 

20 immunization. Two other macaques did develop a strong LPA response after the 
third immunization. Three or four developed positive CD4 intracellular staining. 
One developed positive CD8 intracellular staining. 

Three weeks after the third immunization, it was noted that the physical 
appearance of the polyprotein vaccine solution was visibly turbid. The core vaccine 

25 also was turbid but less so. The Core-NS5 vaccine was also slightly turbid. Analysis 
of this turbidity in the polyprotein formulation indicated that the ISCOM particles 
had precipitated into large aggregates. These aggregates could be dispersed by 
vortexing with 0.1% TWEEN 80 detergent It is probable that this change in the 
formulation of the vaccine occurred before the last immunization. This observed 

30 change in appearance of the vaccines may have affected their immunogencity as 
cellular immune results declined in all three vaccines. 
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The immunogenicity of HCV Core-ISCOMS, NS345Core ]2r ISCOMS and 
Core-ISCOMS + NS5b-ISCOMS as assessed by EIA antibody response is shown in 
Table 18. As can be seen, all three vaccines produced an antibody response by the 
third immunization to their corresponding HCV proteins, except for the 
5 NS345Core 121 -ISCOM vaccine. The NS345Core 12r ISCOM vaccine produced 

antibody responses to NS3, NS4 and a very strong antibody response to NS5, but no 
antibody response to HCV core. 
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Table 18. The immunogenioity of HCV Core-ISCOMS, NS345Core cl -ISCOMS, Core- 
ISOCMS +NS5b-ISCOMS vaccine two weeks post 2 nd immunization and post 3 rf 
immunization as assessed by EIA antibody response to HCV proteins. 


Vaccine 
Macaque 
# 


Anti-Core EIA 
Antibody Titer 


Anti-NS3 EIA 
. Antibody Titer 


Anti-NS4 EIA 
Antibody Titer 


Anti-NS5 EIA 
Antibody Titer 


Post 
2 nd 


Post 3 


Post 
2 nd 


Post3 rf 


Post 
2 nd 


Posts* 


Post 
2 nd 


Post 3 rd 


Core- 
ISCOM 


















X020 


66 


226 














N001 


87 


46 














N086 


363 


396 














X010 


108 


137 
































NS345 
Corel21/ 




















<10 


<10 


<10 


554. 


56 


68 


3,590 


3,405 


X008 


<10 


<10 


66 


995 


14 


44 


2,109 


3,213 


X021 


<10 


"<10 


128 


6,330 


41 


204 


7,213 


8,083 


X023 


<10 


<10 


<10 


3,910 


64 


64 


1,243 


4,704 




















Core- 
ISCOM 
+NS5b- 
ISCOM 


















X022 


<10 


18 










<10 


134 


X014 


<10 


13 










<10 


693 


N154 


542 


554 










<10 


272 


N173 


28 


78 










<10 


258 
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EXAMPLE 17 

Immunization of Chimpanzees with Recombinant HCV Protein and DNA 
Vaccines 

Five groups of five chimps each were immunized EM at 0, 0.7, 2 and 5 

months 

with the formulations presented below. Blood was collected at week 0, two weeks 
subsequent to the second immunization, two weeks following the third immunization 
and two weeks after the fourth immunization. 

Formulation 1: 20 |ag E1E2 polypeptide + MF59 + 500 CpG (produced as 
described above); 

Formulation 2: 1 mg NS345Core|2,-ISCOM (produced as described above); 

Formulation 3: 6 mg each of CTAB-PLG-E1E2 (bp 574-2427, encoding 
amino acids 192-809 of the HCV polyprotein, numbered relative to HCV-1); CTAB- 
PLG-NS34a(bp 3079-5133, encoding amino acids 1027-1711 of the HCV 
polyprotein, numbered relative to HCV-1); CTAB-PLG-NS34ab (bp 4972-5916, 
encoding amino acids 1658-1972 of the HCV polyprotein, numbered relative to 
HCV-1); CTAB-PLG-NS5a (bp 5917-7260, encoding amino acids 1973-2420 of the 
HCV polyprotein, numbered relative to HCV- 1 ); 

Formulation 4: 6 mg each of E1E2 DNA, NS34a DNA, NS34ab DNA and 
NS5a DNA, having the same coordinates as described above, delivered without PLG 
viaelectroporation (see, e.g., U.S. Patent Nos. 6,132,419; 6,451,002, 6,418,341, 
6233,483, U.S. Patent Publication No. 2002/0146831; and International Publication 
No. WO/0045823, for this delivery technique). Results are shown in Figures 8-10. 

As can be seen, in Figure 8, all vaccines were capable of priming CD4+ and 
CD 8+ cells specific to HCV. Thus, all vaccines were successful at inducing a T cell 
response to HCV. Determination of the results for the PLG-DNA from formulation 3 
at two weeks subsequent to the fourth vaccination is in progress. 

As shown in Figures 9 and 10, multiple T cell specificities were induced by 
the two vaccines. Both vaccines primed T-cells specific for multiple T cell epitopes. 

As can be seen in Tables 19 and 20, E1E2 adjuvanted with MF59 primed 
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anti-ElE2 titers. CpG enhanced anti-ElE2 responses as well as TH1 responses and 
the ISCOM and the two DNA vaccines were capable of priming CD4+ and CD8+ T 
cell responses to HCV. 
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Thus, HCV polypeptides and polynucleotides, either alone or as fusions, to 
stimulate cell-mediated immune responses, are disclosed. Although preferred 
embodiments of the subject invention have been described in some detail, it is 
understood that obvious variations can be made without departing from the spirit and 
5 the scope of the invention as defined by the appended claims. 
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We claim : 

1 . A fusion protein comprising HCV polypeptides, wherein the HCV 
polypeptides consist essentially of an NS3, an NS4, anNS5 and a core polypeptide of 

5 a hepatitis C virus (HCV), wherein said core polypeptide consists of amino acids 1 - 
121 of the HCV polyprotein, numbered relative to the full-length HCV-1 polyprotein. 

2. The fusion protein of claim 1, wherein the NS5 polypeptide is an 
NS5a 

10 polypeptide. 

3. The fusion protein of claim 1 , wherein the NS5 polypeptide is an 
NS5b 

polypeptide. 

15 

4. The fusion protein of claim 1 , wherein the NS5 polypeptide is an 
NS5a 

and an NS5b polypeptide. 

20 6. The fusion protein of claim 4, wherein the protein comprises the 

sequence 
of amino acids of SEQ ID NO:8. 

7. A fusion protein according to any of claims 1-6, wherein at least one 
25 of the HCV polypeptides is derived from a different strain of HCV than the other 

HCV polypeptides. 

8. A composition comprising: 

(a) a fusion protein according to any of claims 1-7; and 
30 (b) a pharmaceutically acceptable excipient. 
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9. The composition of claim 8, further comprising an adjuvant. 

10. The composition of claim 8, further comprising a CpG 
oligonucleotide. 

1 1 . The composition of claim 8, wherein said fusion protein is adsorbed to 
or 

entrapped within a microparticle or ISCOM. 



10 12. The composition of claim 8, further comprising a polynucleotide 

encoding an HCV E1E2 complex. 

13. An isolated and purified polynucleotide that encodes a fusion protein 
according to any of claims 1-7. 

15 

' 14. A composition comprising: 

(a) the isolated and purified polynucleotide of claim 13; and 

(b) a pharmaceutically acceptable excipient. 

20 15. The composition of claim 14, further comprising an adjuvant. 

16. The composition of claim 14, wherein said polynucleotide is adsorbed 
to 

or entrapped within a microparticle. 

25 

1 7 . The composition of claim 1 4, further comprising a polynucleotide 
encoding an HCV E1E2 complex. 

18. A method of activating T cells of a vertebrate subject which recognize 
30 an epitope of an HCV polypeptide, comprising the step of: 

administering the composition of any of claims 8-12 to said vertebrate 
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subject, whereby a population of activated T cells recognizes an epitope of the NS3, 
NS4, NS5 and/or core polypeptides. 

1 9. A method of activating T cells of a vertebrate subject which recognize 
5 an epitope of an HCV polypeptide, comprising the step of: 

administering the composition of any of claims 14-17 to said vertebrate 
subject, whereby a population of activated T cells recognizes an epitope of the NS3, 
NS4, NS5 and/or core polypeptides. 

10 20. The method of claim 1 9 , wherein the polynucleotide is administered 

via 

electroporation. 

21. Use of a composition according to any of claims 8-12 and 14-17 for 
15 activating T cells of a vertebrate subject which recognize an epitope of an HCV 

polypeptide, wherein a population of activated T cells recognizes an epitope of the 
NS3, NS4, NS5 and/or core polypeptides. 

22. Use of a fusion protein according to any of claims 1-7 for the 

20 manufacture of a medicament for activating T cells of a vertebrate subject which 
recognize an epitope of an HCV polypeptide, wherein a population of activated T 
cells recognizes an epitope of the NS3, NS4, NS5 and/or core polypeptides. 



25 



23. Use of a polynucleotide according to claim 1 3 for the manufacture of 
a 

medicament for activating T cells of a vertebrate subject which recognize an epitope 
of an HCV polypeptide, wherein a population of activated T cells recognizes an 
epitope of the NS3, NS4, NS5 and/or core polypeptides. 



30 
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1 10 
MAPIT A YAQQ 
ATG GCG CCC ATC ACQ GOG TAG OCC CAG GAG 

.20 

T R G & LG C I I T. S L TGR 
ACA AGG GGC CTC CTA GOG TGC ATA ATC ACC AGC CTA ACT GGC COG 

30 40 
DKHQVEGBVQ IVSTA 
-GAC AAA AAC CAA~ GTG GAG GGT GAG GTC GAG ATT GTG TCA ACT ,GCT 

50 

AQ T PIi ATCINGVCW T 
GCC CAA ACC TTC CTG OCA ACQ TGC ATC AAT GGG GTG TGC TOG ACT 

60 70 

.VYHGAGTRTIASPKG 
GTC TAC CAC GQG GCC GGA AOG AGG ACC ATC GOG TCA CCC AAG GGT 

80 

P V I QMYTNVD Q D L V G 
CCT GTC ATC CAG ATG TAT ACC AAT GTA GAC CAA GAC CTT GTC? GGC 

90 • 100 

WPAPQGS RSIiT PCT C 
TGG CCC GCT CCG CAA GGT AGC CGA TCA TTG ACA CCC TGC ACT TGC 

110 

<3S SDIiY IiVTRHADVI 
GGC TCC TCG 5AC CTT TAC CTG GTC ACQ AGG CAC GCC GAT GTC ATT 

120 130 
PVRRRG DSRGSJjIiSP 
CCC GTG OGC CGG COG GGT GAT AGC AGG GGC AGC CTG CTG TCG CCC 

140 ■ 

RPISYLKGBSGGPLL 
OGG- CCC ATT TCC TAC TTG AAA GGC TCC TOG GGG GGT CCG CTG TTG 

"0 160 
C P A G HAV G. I F R AAVC 
TGC CCC GCG GGG CAC GCC GTG GGC ATA TTT AGG GCC GCG GTG TGC 

170 

TRGVAKAVD F I PVB-N 
ACC OGT GGA GTG GOT AAG GCG GTG GAC TTT ATC CCT GTG GAG AAC 

180 

L E T T M R 8 
CTA GAG ACA ACC ATG AGG TCC 
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MATURE El 

SerPheSerIlePheLe\iLeuMaLeuLeuSer(^eLe\illirValProMaSeriUaTj^ 192 
TCTTTCTCTATCTTCCTTCTGGC^ 

AGAAAGAGATAGAAGGAAGACCGGGACGAGAGAACG&ACTGAG^ 



<31nValArgAsnSerThrGi 212 

CAAGTGCGCAACTCCACGGGGCrrCTACCAC^CA 

GTTCACX3CGTTGAGGTGCCC01AGATGGTGCAGTGGTTACT 

ValTyrGluAlaAlaAspAlalleLeuHieTlirPr^ 232 

GTGTAOGAGGCGGCCGATGCCATCCTGCACACT 

CACATGCTCCGCCCK3C?rACGGTAGGACGTGTGAGG 

GlyAsnAlaSerArg<^sTrpValAlaMetT2irP 252 

GGCAACX^CTCGAGGTGTTGGGTGG<^TGAC 

CCGTTGCGGAGCTCCAC^CCCACC^ 

IieuPraAlaThrGlnljeuArgArgHis^^ 272 

CTCCCCX^GAOSCAGCrrTCGACOT 

GAGGGGCGCTGCGTCGAAGCTGCAGTGTAGCT^ 

SerAlaLeuTyrValGlyABpLeuC^sGlySerValPheLeuValGlyGlnLeuPheTlir 2 92 

TCGGCCCTCTACGTGGGGGACCTGTGCGGGTCTGTCTTTC 

AGCCGGGAGJVTCCACCCCCTGGACACGCC 

PheSerProArgArgHisTrpTlirThrGlnGlyCysAsnCysSerlleTyxPrQGl 3 12 

TTCTCrrCCCAGGCGCCACTGGACGACX3CAAGGTTC 

AAGAG&GGGTCCGCGGTGACCTCCTGCG 

IleThrGlyHisArgMetAlaTrpAspMetMetMetttenT^ 332 

ATAACGGGTC^CCGC^TGGCATGGGATATGATGATGAACTGOT 

TATTGCCCAGTGGCGTACCGTACCCTATAC^^ 

! 

ValMet^aGlnLeuLeUArglleProGlnAlalle^ 3 S2 - 

GTAATGGCrCAGCTGCTCCGGATCC^ 

CATTACCGAGTCGACGAGGCCTAGGGTGnKX3GTAG^ 

TrpGlyValLeuAlaGlylleAlaTyrPheSerMetValGlyAanTrpAlaLysValLeu 372 
TGGGGAGTC CTGGCGGG C^TAGCGTATTIH?TCCATGG1X3GGGAACTXK^ CGAAGGTCCTG 
ACCCXTIX^GGACCXSCCCGTATCGCATAAAG&GG 

E2 

ValVallieuLeiiLeuPheAlaGlyValAepAlaGluThrHieValThrGlyGlySerAla 3 92 

GTAGTGCHXSCTGCTATTTCCOTGCGTC^ 

CATCACGACGACGATAAACGGCXXKIAGCTGCGC^^ 

GlyHisThrValSerGlyPheValSerLeiilieuAlaProGlyAlaLysGlnAsriValGln 412 

GGCCACACTGTGTCTGGATTTGTTAGCCTCCTC 

CCGGTGTCACACAGACCTAAACAATCGGAGGAGCGTGGTCC^ 
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LeuIleAsnThrAenGlySerTr^ 43 2 

CTGATCAACMCAACGGCAGTTGGCACCTCyU^^ 

GACTAGTTGTGGTTGCTOTCAACCGTGGAGTTATC^^ 



. LeuAsnThrGlyTrpLeuAlaGlyte^ 452 
CTC^C^CGGCTGGTTGGCAGGGCTTO 

GAGTTGTGGCCGACCAAC03TCCCGAAAAGATAGTGGTGTTCAAGTTGAGAAG 

ProGliiArgLeuAlaSerCysArgProLeuThrAspPheAspGlnGlyTrpGlyProIle 472 
CCTGAGAGGCTAGCCAGCTGCCGACCCC TTACCGATTTTGACCAGGGCTGGGG CCCTATC 
GGACTCTCCGATCGGTCGAOKKITGGGGAAl^CT 

SerTyrAlaAsnGlySerGlyProAspGlnA^ 492 
AGTTATGCCAACGGAAGCGGCCCra^ 

TCAATACGGTTGCCTTCGCCGGGG CTGGTCGCGGGGATGAeGA(XX5TGATGGGGGGTTTT 

ProCVBGlylleValProAlaliysSerValCyBGlyProValTyri^ePheThrProSer 512 
CCTTGCX^TATTOTGCCCGC^^ 

GGAA03<X!ATAACAOTGGCG<^^ 

ProValValValGlyThrrarAspArgSerGlyAl^ 532 

CCCGTGGTGOTGGGAACX3&CCGACAGGT 

GGGC^CCACC^CCCTTGCIX^ 

AepThrAspValPheValLeuAs nAsnThrArgProProLeuGlyAsnTrpPheGlyCys 552 

GATACGGACGTCTTCX3TOTITAAC^ 

CTATGCXrTGGAGAAGCAGG^TTGTTATGGTCCGGTGGC^ 

ThrTrpMetAsnSerThrGlyPheThrLyBValC^sGlyAlaProProCyeVallleGly 572 

ACCTGGATCAACTCAACTGGAITCACCAAAGTOTGCGGAGTC 

TGGACCTACTTGAGTTGLA.CCT 

GlyMaGlyAsiiAsnThrLeuHisCysProl^ 592 

GGGGCGGGC^CAACACCCTGCACIW 

CCCCGCCCGTTGTTGTGGGATCra 

ThrTyrSerAi^CysGlySerGlyProTrpIle^^ 612 

ACATACTCTCGGTGCX3GCTCXXKJTCCCT|3GATC 

TGTATGAGAGCCACXK2CX3AGGCCAGGGACCT 

TyrArgLeuTrpHisTyrProC^ 63 2 

TATAGGCTTIKKjC^TTATCCTTGTACCIATCAACTAC^ 

ATATCC3GAAACCGTAATAGGAACATOGTAGTTGATGTGATATAAATTTTAGTC 

ValGlyGlyValGluHisArgLeuGluAlaAlaCYsAsn^^ 652 

GTGGGAGGGGTCGAGCACAGGCTGGAAGCTGCCT^ 

CTICCCTCCCCAGCTOTTGTCCGACCITC^ 

AspLeuGluAspArgAspArgSerGluI^uSerProLeuLeu^ 672 

GATCTGGAAGATAGGGACAGGTCCGAGCTCAGCCCGTTACTGCTGA 

CTAGAC^TTCTATCCCTGTCXAGGCTCGAGTO 
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GlnValLeuProC^sSerPheThrTftr^ 692 

C2AGGTCCTCCCQTGTTCCTTCACAACCCTGCCAGCCTTGTC 

GTCCAGGAGGKX!ACAAGGAAGTGTTGGGAOGGTCGGA^^ 

HisGlnAsnlleValAspValGlnTyrLeuTyrGlyVaiGlySerSerlleAlaSert^ 712 

CACCAGAACATTGTGGAOaTGCAGTACTTGTACG^ 

GTGGTCTTGTAACACCTGCACGTCATGAAC^^ 

Alal lefcysTrpGluTyrValValLeuLeuP 732 

GCC^TTAAGTGGGAGTACGTCGTCCTCCT^ 

CGGTAATTC^CCCTC^TGCAGC^GGAGGACAAGGAAGACG 

P7 

SerCys^uTrpMetMetLeuLeuIleSerGlrlAl^ 752 

TCCTGCTTGTGGATGATGCTACTCATATCrc 

AGGACGAAGACCTACTACGATGAGTATAGGGTTCGCCTTCGC^ 

IleLeuAsnAlaAlaSerLeuAlaGlyllirHlsGlyLeuValSerPheiieuValPhePhe 772 
ATACTTAATGCAGCATCCCTGGCCGGG^ . 
TATGAATTACGTCGTAGGGACCGGCCCTGOo^ 

Cys^eAlaTrpTyxLeuLysGlyLy sTrpValPix>GlyMaValTyTThrPheTyxGly 792 

TOCTTTGCATGGTATCTGAAGGGTAAGTGGGTGCCCGGAGC 

ACGAAACGTACCATAGACTTCCXIAraCACCCA 

MetTrpProLeuLeuLeuLeuLeuLeu^ 809 

ATGTGGCCTCT^CTCCTGCTCCTGTTGGCGT^ 

TACACCX3GAGAGGAGGACX3AGGACAACXXrC 
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MAAYAAQGYK 
ATG GCT GCA TAT GCA GCT GAG GGC TAT AAG 

20 

V L V L N * P S V A A T L G F G 
GTG CTA GTA CTC AAC* CCC TCT GTT GCT GCA ACA CTG GGC TTT GGT 

30 40 
AYMSKAHGIDPNIRT 
GCT TAC ATG TCC AAG. GCT CAT GGG ATC GAT CCT AAC ATC AGG ACC 

50 

vG V RT I TTGS P I T Y S T 
GGG GTG AGA ACA ATT ACC ACT GGC AGC CCC ATC ACG TAC TCC ACC 

60 70 
YGKFLADGGCSGGAY 
TAC GGC AAG TTC CTT GCC GAC GGC GGG TGC TCG GGG .GGC GCT TAT 

80 

D I I I C DEC HST DA T S 
GAC ATA ATA ATT TGT GAC GAG TGC CAC TCC ACG GAT GCC ACA TCC 

90 100 
I LG I GTVLDQAETAG 
ATC TTG GGC ATT GGC ACT GTC CTT GAC CAA GCA GAG ACT GCG GGG 

110 

ARLVV LATATP PGSV 
GCG AGA CTG GTT GTG CTC GCC ACC GCC ACC CCT CCG GGC TCC ' GTC 

120 130 
TVPHPN IEEVALSTT 
ACT GTG CCC CAT CCC AAC ATC GAG GAG GTT GCT CTG TCC ACC ACC 

140 

GEIPFYG KAIPLEVI 
GGA GAG ATC CCT. TTT TAC GGC AAG GCT ATC CCC GTC GAA GTA ATC 

150 160 
KGGR HLIFCHSKKKC 
AAG GGG GGG AGA CAT CTC ATC TTC TGT CAT TCA AAG AAG AAG TGC 

170 

DE LAA KLVALG I N A V ' 
GAC GAA CTC GCC- GCA AAG CTG GTC GCA TTG GGC ATC AAT GCC GTG 



180 190 
AY YRGLDVSVIPTSG 
GCC TAC TAC CGC GGT CTT GAC GTG TCC GTC ATC CCG ACC AGC GGC 
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200 

DVVVVAT DALMTGYT 
GAT GTT GTC GTC GTG GCA ACC GAT GCC CTC ATG ACC GGC TAT ACC 

210 220 
GDFD SVIDCNTCVTQ 
GGC GAC TTC GAC TCG GTG ATA GAC TGC AAT ACG TGT GTC ACC CAG 

230 

TVDFSL D PTFTIETI 
ACA GTC GAT TTC AGC CTT GAC CCT ACC TTC ACC ATT GAG ACA ATC 

" - 240 250 

T L P Q 'D AV SRTQRRG R 
ACG CTC CCC CAA GAT GCT GTC TCC CGC ACT CAA CGT CGG GGC AGG 

260 

TGRGKPGIYRFVAPG 
ACT GGC AGG GGG AAG CCA GGC ATC TAC AGA TTT GTG GCA CCG GGG 

270 280 
ERP SGMFDSSVLCEC 
GAG CGC CCC TCC GGC ATG TTC GAC TCG TCC GTC CTC TGT GAG TGC 

290 

. Y DA G C AWYELT P A E T 
TAT GAC GCA GGC TGT GCT TGG TAT GAG CTC ACG CCC GCC GAG ACT 

300 310 
TVRL'RAYMN TPGLPV 
ACA GTT AGG CTA CGA GCG TAC ATG.AAC ACC CCG GGG CTT CCC GTG 

320 

CQDHL EFWEGVFTGL 
TGC CAG GAC CAT CTT GAA TTT TGG GAG GGC GTC TTT ACA GGC CTC 

330 340 
THIDAHFLS QTKQSG 
ACT CAT ATA GAT GCC CAC TTT CTA TCC CAG ACA AAG CAG AGT GGG 

350 

ENLPYLVAYQATVCA 
GAG AAC CTT CCT TAC CTG GTA GCG TAC CAA GCC ACC GTG TGC GCT 

360 370 
RAQ. AP P PSWDQMWKC 
AGG GCT CAA GCC CCT CCC CCA TCG TGG GAC CAG ATG TGG AAG TGT 

380 

LI RLKPTLHGPTPLL 
TTG ATT CGC CTC AAG CCC ACC CTC CAT GGG CCA ACA CCC CTG CTA 
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N E 



H 



400 
P 



TAC AGA CTG GGC GCT GTT CAG AAT GAA ATC ACC CTG ACG CAC CCA 

■ 410 

V T KY I MTCMSAD L E V 
GTC ACC AAA TAC ATC ATG ACA TGC ATG TCG GCC GAC CTG GAG GTC 

420 430* 
VTSTWVLVGG'VLAAL 
GTC ACG AGC ACC TGG GTG CTC GTT GGC GGC GTC CTG GCT GCT TTG 

440 

*A AYCLSTGCVVIVGR 
GCC GCG TAT TGC CTG TCA ACA GGC TGC GTG GTC ATA GTG GGC AGG 

450 460 
VVLSGKPAIIPDREV 
GTC GTC TTG TCC GGG AAG CCG GCA ATC ATA CCT GAC AGG GAA GTC 

470 

LYREFDEMEECSQHL 
CTC TAC CGA GAG TTC GAT GAG ATG GAA GAG TGC- TCT CAG CAC TTA 

480 1 490 

. P Y I EQGMMLAEQ FK Q. 
CCG TAC ATC GAG CAA GGG ATG ATG CTC GCC GAG CAG TTC AAG CAG 

500 

KALGLLQTASRQAEV 
AAG GCC CTC GGC CTC CTG CAG ACC GCG TCC CGT CAG GCA GAG GTT 

510 520 
IAPAVQTNWQKLETF 
ATC GCC CCT GCT GTC CAG ACC AAC TGG CAA AAA CTC GAG ACC TTC 

530 

WAKHMWNFISGIQYL 
TGG GCG AAG CAT ATG TGG AAC TTC ATC AGT GGG ATA CAA TAC TTG 

540 550 
AGLSTLPGNPAIAS L 
GCG GGC TTG TCA ACG CTG CCT GGT AAC CCC GCC ATT GCT TCA TTG 



560 

MAFTAAVTS P LT TS Q 
ATG GCT TTT ACA GCT GCT GTC ACC AGC CCA CT£ ACC ACT AGC CAA 

570 580 
TLLFN ILGGWVAAQL 
ACC CTC CTC TTC AAC ATA TTG GGG GGG TGG GTG GCT. GCC CAG CTC 
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590 

AAPGA'ATAFVGAGLA 
GCC GCC CCC GGT GCC GCT ACT GCC TTT GTG GGC GCT GGC TTA GCT 

600 610 
GAAIGSVGLGKVLID 
GGC GCC GCC ATC GGC AGT GTT GGA CTG GGG AAG GTC CTC ATA GAC 

620 

I L A G Y G" A G V A G A L VA 
ATC CTT GCA GGG TAT GGC GCG GGC GTG GCG GGA GCT CTT GTG GCA 

" - 630 640 

FKIMSGEVPSTEDLV 
TTC AAG ATC ATG AGC GGT GAG GTC CCC TCC ACG GAG GAC CTG GTC 

650 

N> L L PA I L S P GALVV G 
AAT CTA CTG CCC GCC ATC CTC TCG CCC GGA GCC CTC GTA GTC GGC 

660 670 
VVCAAILRRHVGPGE 
GTG GTC TGT GCA GCA ATA CTG CGC CGG CAC GTT GGC CCG GGC GAG 

680 

GAVQWMNRLIAFASR 
GGG GCA GTG CAG TGG ATG AAC CGG CTG ATA GCC TTC GCC TCC CGG 

690 b 700 

GNHVS PTHYVPESDA 
GGG AAC CAT GTT TCC CCC ACG CAC TAC GTG CCG GAG AGC GAT GCA 

710 

AARVTAILSSLTVTQ 
GCT GCC CGC GTC ACT GCC ATA CTC AGC AGC CTC ACT GTA ACC CAG 

' 720 ; 730 

LLRRLH QWISSE CTT 
CTC CTG AGG CGA CTG. CAC CAG TGG ATA AGC TCG GAG TGT ACC ACT 

740 

■PCSGSWLRDIWDWIC 
CCA TGC TCC GGT TCC TGG CTA AGG GAC ATC TGG GAC TGG ATA TGC 

750 760 
EVLS D. FKTWLKAKLM 
GAG GTG TTG AGC GAC TTT AAG ACC TGG CTA AAA GCT AAG CTC ATG 



770 

PQLPGIP. FVSCQRGY 
CCA CAG CTG CCT GGG ATC CCC TTT GTG TCC TGC CAG CGC GGG TAT 
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780 790 
K.GVWRGDG IMHTRCH 
AAG GGG GTC TGG CGA GGG GAC GGC ATC ATG CAC ACT CGC TGC CAC 

800 

CGAEITGHVKNGTMR- 
TGT GGA GCT GAG ATC ACT GGA CAT GTC AAA AAC GGG ACG ATG AGG 

810 820 
IVGPRTCR NMWSGTF 
ATC GTC GGT CCT AGG ACC TGC AGG AAC ATG TGG AGT GGG ACC TTC 

> . 830 

PINAYTT GPCTPLPA 
CCC ATT AAT GCC TAC ACC ACG GGC CCC TGT ACC CCC CTT CCT GCG 

840 850 
PNY T FALWRVSAE EY 
CCG AAC TAC ACG TTC GCG CTA TGG AGG GTG TCT GCA GAG GAA TAC 

860 

V E I RQVGDFHYVTGM 
GTG GAG ATA AGG CAG GTG GGG GAC TTC CAC TAC GTG ACG GGT ATG 

870 880 
. T T D NLKC PCQVPS PE 
ACT ACT GAC AAT CTT AAA TGC CCG TGC CAG GTC CCA TCG CCC GAA ' 

890 

FF .TELDGVRLHRFAP 
TTT TTC ACA GAA TTG GAC GGG GTG CGC CTA CAT AGG TTT GCG CCC 

900 910 
PCKPLLREEVS FRVG 
CCC TGC AAG CCC TTG CTG CGC GAG GAG GTA TCA TTC AGA GTA GGA 

920 

LHEYPVGSQLPCEPE 
CTC CAC GAA TAC CCG GTA GGG TCG CAA TTA CCT TGC GAG CCC GAA 

930 940 
PDVAVLTSMLTDPSH 
CCG GAC GTG GCC GTG TTG ACG TCC ATG CTC ACT GAT CCC TCC CAT 

: 950 

I T A EAA GRR LARGS P 
ATA ACA GCA GAG GCG GCC GGG CGA AGG TTG GCG AGG GGA TCA CCC 



960 970 
PSVASSS ASQLSAPS 
CCC TCT GTG GCC AGC TCC TCG GCT AGC CAG CTA TCC GCT CCA TCT 
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980 

L KAT C TANHDSPDAE' 
CTC AAG GCA ACT TGC ACC GCT AAC CAT GAC TCC CCT GAT GCT GAG 

990 1000 
L I.E AN L LWRQEM G G N 
CTC ATA GAG GCC AAC CTC CTA TGG AGG CAG GAG ATG GGC GGC AAC 

1010 

I T RVESENKVVILDS 
ATC ACC AGG GTT GAG TCA GAA AAC AAA GTG GTG ATT CTG GAC TCC 

1020 1030 
F'D PLVAEEDEREI SV 
TTC GAT CCG CTT GTG ' GCG GAG GAG .GAC GAG CGG GAG ATC TCC GTA 

1040 

PAE I LRKSRRFAQAL 
CCC GCA GAA ATC. CTG CGG AAG TCT CGG AGA TTC GCC CAG GCC CTG 

1050 1060 
PVWARPDYNPPLVET 
CCC GTT TGG GCG CGG CCG* GAC TAT AAC CCC CCG CTA GTG GAG ACG 

1070 

WKKPDYEPPVVHGCP 
TGG AAA AAG CCC GAC TAC GAA CCA CCT GTG GTC CAT GGC TGC CCG 

1080 1090 
LPPPKSPPVPPPRKK 
CTT CCA CCT CCA AAG TCC CCT CCT GTG CCT CCG CCT CGG. AAG AAG 

1100 1 

RTVVLTESTLSTALA 
CGG ACG GTG GTC CTC ACT GAA TCA ACC CTA TCT ACT GCC TTG GCC 

1110 1120 
ELATRS. FGSSSTSGI 
GAG CTC GCC ACC AGA AGC TTT GGC AGC TCC TCA ACT TCC GGC ATT 

1130 

TGDNTTTSSEPA PSG 
ACG GGC GAC .AAT ACG ACA ACA TCC TCT GAG CCC GCC CCT TCT GGC 

1140 1150 
CPPDSDAESYSSMPP 
TGC CCC CCC GAC TCC GAC GCT GAG TCC TAT TCC TCC ATG CCC CCC 

1160 

LEGEPGDPDLSDGSW 
CTG GAG GGG GAG CCT GGG GAT CCG GAT CTT AGC GAC GGG TCA TGG 
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1170 1180 
STVSSEANAEDVVCC 
TCA ACG GTC AGT AGT GAG GCC AAC GCG GAG GAT GTC GTG TGC TGC 

1190 

SMSYSWTGALVTPCA 
TCA ATG TCT TAC TCT TGG ACA GGC GCA CTC GTC ACC CCG TGC GCC 

1200 1210 
AEEQKLPINALSNSL 
GCG GAA GAA CAG AAA CTG CCC ATC AAT GCA CTA AGC AAC TCG TTG 

1220 

LRHHNLVYS T TSRSA 
CTA CGT CAC CAC AAT TTG GTG TAT TCC ACC ACC TCA CGC AGT GCT 

1230 1240 
CQRQ KKVT.F DRLQVL 
TGC CAA AGG CAG AAG AAA GTC ACA TTT GAC AGA CTG CAA GTT CTG 

1250 

DS HYQDVLKEVKAAA. 
GAC AGC CAT TAC CAG GAC GTA CTC AAG GAG GTT AAA GCA GCG GCG 

1260 . 1270 

SKVKANLLSVEEACS 
TCA AAA GTG AAG GCT AAC TTG CTA TCC GTA GAG GAA GCT TGC AGC 

1280 

LT PPHSAK SKFGYGA 
CTG ACG CCC CCA CAC TCA GCC AAA TCC AAG TTT GGT TAT GGG GCA 

1290 1300 
KDVRCHARKAVTH IN 
AAA GAC GTC CGT TGC CAT GCC AGA AAG GCC GTA ACC CAC ATC AAC 

1310 

SVWKDLLEDNVTPID 
TCC GTG TGG AAA GAC CTT CTG GAA GAC AAT GTA ACA CCA ATA GAC' 

1320 1330 
TTIMAKNEVFCVQ.PE 
ACT ACC ATC ATG GCT AAG AAC GAG GTT TTC TGC GTT CAG CCT GAG 

1340 

KGGRKPARLIVFPDL 
AAG GGG GGT CGT AAG CCA GCT CGT CTC ATC GTG TTC CCC GAT CTG 

1350 1360 
GVRVCEKMALYDVVT 
GGC GTG CGC GTG TGC GAA AAG ATG GCT TTG TAC GAC GTG GTT ACA 
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1370 

KLPLAVM.GS SYGFQY 
AAG CTC CCC TTG GCC GTG ATG GGA AGC TCC TAC GGA TTC CAA TAC 

1380 1390 
S PGQRVE- FLVQAWKS 
TCA CCA GGA CAG CGG GTT GAA TTC CTC GTG CAA GCG TGG AAG TCC 

1400 

KKTPMGF SYDTRCFD 
AAG AAA ACC CCA ATG GGG TTC TCG TAT GAT ACC CGC TGC TTT GAC 

1410 1420 
S TVTESDIRTEEAIY 
TCC ACA GTC ACT GAG AGC GAC ATC CGT ACG GAG GAG GCA ATC TAC 

, 1430 

Q C C D L D P Q A R V A • I K S 
CAA TGT TGT GAC CTC GAC CCC CAA GCC CGC GTG GCC ATC AAG TCC 

1440 1450 
LTERLYVGGPLTNSR 
CTC ACC GAG AGG CTT TAT GTT GGG GGC CCT CTT ACC AAT TCA AGG 

1460 

GENCGYRRCRASGVL 
GGG GAG AAC TGC GGC TAT CGC AGG TGC CGC GCG AGC GGC GTA CTG 

1470 1480 
TTSCGNTLTCYIKAR 
ACA ACT AGC TGT GGT AAC ACC CTC ACT TGC TAC ATC AAG GCC CGG 

1490 

AAC- RAAGLQDCTMLV 
GCA GCC TGT CGA GCC GCA GGG CTC CAG GAC TGC ACC ATG CTC GTG 

1500 1510 
CGDDLVVICESAGVQ 
TGT GGC GAC GAC TTA GTC GTT ATC TGT GAA AGC GCG GGG GTC CAG 

1520 

EDAAS L RAFTEAMT R 
GAG GAC GCG GCG AGC CTG AGA. GCC TTC, ACG GAG GCT ATG ACC AGG 

1530 1540 
YSAPPGDPPQPEYDL 
TAC TCC GCC CCC CCT GGG GAC CCC CCA CAA CCA GAA TAC GAC TTG 

1550 

ELITSCSSNVSVAHD . 
GAG CTC ATA ACA TCA TGC TCC TCC AAC GTG TCA GTC GCC CAC GAC 
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1560 1570 

gagkrvyylt'rdptt. 
ggc gct gga aag agg gtc tac tac ctc acc cgt gac cct aca acc 

1580 

PLARAAWETARHTPV 
CCC CTC GCG AGA GCT GCG TGG GAG ACA GCA AGA CAC ACT CCA GTC 

1590 1600 
N SWLGNI IMFAPTLW 
AAT TCC TGG CTA GGC AAC ATA ATC ATG TTT GCC CCC ACA CTG TGG 

1610 

ARMILMTHFFSVLIA 
GCG AGG ATG ATA CTG ATG ACC CAT TTC TTT AGC GTC CTT ATA GCC 

1620 1630 
RDQLEQALDCEIY GA 
AGG GAC CAG CTT GAA CAG GCC CTC GAT TGC GAG ATC TAC GGG GCC 

1640 

CYS IEPLDLPPIIQR 
TGC TAC TCC ATA GAA CCA CTG GAT CTA CCT CCA ATC ATT CAA AGA 

, 1650 1660 

L H G . L . S A F S L H S Y S P G * 
CTC CAT GGC CTC AGC GCA TTT TCA CTC CAC AGT TAC TCT CCA GGT 

1670 

EINRVAACLRKLGVP 
GAA ATC AAT AGG GTG GCC GCA TGC CTC AGA AAA CTT GGG GTA CCG 

1680 1690 
PLRAWR, HRARSVRAR 
CCC TTG CGA GCT TGG AGA CAC CGG GCC CGG AGC GTC CGC GCT AGG 

1700 

L LA R .G G R A A I CGKY L 
CTT CTG GCC AGA GGA .GGC AGG GCT GCC ATA TGT GGC'AAG TAC CTC 

1710 1720 
FNWAVRTKLKLTPIA 
TTC AAC TGG GCA GTA AGA ACA AAG CTC AAA CTC ACT CCA ATA GCG 

1730 

AAGQLDLS GWFTAGY 
GCC GCT GGC CAG CTG GAC TTG TCC GGC TGG TTC ACG GCT GGC TAC 

1740 " 1750 

SGGDIYHSVSHARPR 
AGC GGG GGA GAC ATT TAT CAC AGC GTG TCT CAT GCC CGG CCC CGC 
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1760 

WIW FCLLLLAAG VG I 
TGG ATC TGG TTT TGC CTA CTC CTG CTT GCT GCA GGG GTA GGC ATC 

1770 1780 
YLL.PNRMSTNPKPQR 
TAC CTC CTC CCC AAC CGA ATG AGC ACG AAT CCT AAA CCT CAA AGA 

1790 

KTKRNTNRRPQDVKF 
AAG ACC AAA CGT AAC.ACC AAC CGG CGG CCG CAG GAC GTC AAG TTC 

1800 1810 
PGGGQIVGGVYLLPR 
CCG GGT GGC GGT CAG ATC GTT GGT GGA GTT TAC TTG TTG CCG CGC 

1820 

RGP RLGVRATRKT S E 
AGG GGC CCT AGA TTG GGT GTG CGC GCG ACG AGA AAG ACT TCC GAG 

1830 1840 
RSQ PRGRRQPIPKAR 
CGG TCG CAA CCT CGA GGT AGA CGT CAG CCT ATC CCC AAG GCT CGT 

1850 

RPEGRTWAQPGYPWP 
CGG CCC GAG GGC AGG ACC TGG GCT CAG CCC GGG TAC CCT TGG CCC 

1860 1870 
L 'Y G N E GC GWAG WL L S 
CTC TAT GGC AAT GAG GGC TGC GGG TGG GCG GGA TGG CTC CTG TCT 

1880 

PRGSRPSW GPTDPRR 
CCC CGT GGC TCT CGG CCT AGC TGG GGC CCC ACA GAC CCC CGG CGT 

1890 1892 
R S R N L G K 
AGG TCG CGC AAT TTG GGT AAG 



FIG. 5J 



• 



WO 2004/039950 



PCT/US2003/033610 



17/21 




CD 



WO 2004/039950 



PCT/US2003/033610 



18/21 




+BU1IJUB6-NJ|| '+8QO *° % popejjoo 



WO 2004/039950 



PCT/US2003/033610 




WO 2004/039950 



PCT/US2003/033610 



20/21 




n 

Sin 

- iu s z z 



I II I 



I 

! » 

l * 
I 



l 



CO 
CO 



g eAMSod +800 




+ 



00 
CO 

8 



g eAijisod +j7ao 

a 



CO 



00 
CO 



IA 
CO 

I 




CD 
EC 



WO 2004/039950 PCT/US2003/033610 




WO 2004/039950 PCT/US2003/033610 



SEQUENCE LISTING 

<110> HOUGHTON, Michael 
CGATES, Steve 
SELBY, Mark 
PALIARD, Xavier 

<120> ACTIVATION OF HCV-SPECIFIC T- CELLS 

<130> 2300-1612.60, 

<150> 10/281,341 
<151> 2002-10-25 

<160> 10 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: fusion protein 
epitope 

<400> 1 

His Glu Tyr Pro Val GlySer Gin Leu 
1 5 



<210> 2 
<211> 15 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: fusion protein 
epitope 

<400> 2 

Ala Glu Leu' He Glu Ala Asn Leu Leu Trp Arg Gin Glu Met Gly 
15 10 15 



<210> 3 
<211> 1914 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HCV-1 El/E2/p7 region 

<220> 

<221> CDS 

<222> (1) . . (1911) 

<400> 3 

tct ttc tct ate ttc ctt ctg gec ctg etc tct tgc ttg act gtg ccc 
Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro 



1 
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15 10 15 

get teg gec tac caa gtg cgc aac tec acg ggg etc tac cac gtc ace 
Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr 
20 25 30 



etc ccc gcg acg cag ctt cga cgt cac ate gat ctg ctt gtc ggg age 
Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 
85 90 95 



cgc atg gca tgg gat atg atg atg aac tgg tec cct acg acg gcg ttg 
Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu 
150 155 160 



145 



tct gga ttt gtt age etc etc gca cca ggc gee aag cag aac gtc cag 
Ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala Lys Gin Asn Val Gin 
225 230 235 240 

ctg ate aac acc aac ggc agt tgg cac etc aat age acg gec ctg aac 
Leu He Asn Thr Asn Gly Ser Trp His Leu Asn Ser Thr Ala Leu Asn 
245 250 255 



96 



aat gat tgc cct aac teg agt att gtg tac gag gcg gec gat gec ate 144 
Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala Ala Asp Ala He 
35 40 45 

ctg cac act ccg ggg tgc gtc cct tgc gtt cgc gag ggc aac gec teg 192 
Leu His Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser 
50 55 60 

agg tgt tgg gtg gcg atg acc cct acg gtg gec acc agg gat ggc aaa 240 
Arg Cys Trp Val Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys 
65 70 75 80 



288 



gec acc etc tgt teg gec etc tac gtg ggg gac ctg tgc ggg tct gtc 336 
Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Ser Val 
100 105 HO 

ttt ctt gtc ggc caa ctg ttt acc ttc tct ccc agg cgc cac tgg acg 384 
Phe Leu Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr 
115 120 125 

acg caa ggt tgc aat tgc tct ate tat ccc ggc cat ata acg ggt cac 432 
Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
130 135 140 



480 



528 



gta atg get cag ctg etc egg ate cca caa gee ate ttg gac atg ate 
Val Met Ala Gin Leu Leu Arg He Pro Gin Ala He Leu Asp Met He 
165 170 175 

get ggt get cac tgg gga gtc ctg gcg ggc ata gcg tat ttc tec atg 576 
Ala Gly Ala His Trp Gly Val Leu Ala Gly He Ala Tyr Phe Ser Met 
180 185 190 

gtg ggg aac tgg gcg aag gtc ctg gta gtg ctg ctg eta ttt gec ggc 624 
Val Gly Asn Trp Ala Lys Val Leu Val Val Leu Leu Leu Phe Ala Gly 
195 200 205 

gtc gac gcg gaa acc cac gtc acc ggg gga agt gee ggc cac act gtg 672 
Val Asp Ala Glu Thr His Val Thr Gly Gly Ser Ala Gly His Thr Val 
210 215 220 



720 



768 
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tgc aat gat age etc aac acc ggc tgg ttg gca ggg ctt ttc tat cac 816 
Cys Asn Asp Ser Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr His 
260 265 270 

cac aag ttc aac tct tea ggc tgt cct gag agg eta gee age tgc cga 864 
His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg 
275 280 285 

ccc ctt acc gat ttt gac cag' ggc tgg ggc cct ate agt tat gee aac 912 
Pro Leu Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn 
290 295 300 

gga age ggc ccc gac cag cgc ccc tac tgc tgg cac tac ccc cca aaa 960 
Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys 
305 310 315 320 

cct tgc ggt att gtg ccc gcg aag agt gtg tgt ggt ccg gta tat tgc 1008 
Pro Cys Gly He Val Pro Ala Lys Ser Val Cys Gly Pro Val Tyr Cys 
325 < 330 335 

ttc act ccc age ccc gtg gtg gtg gga acg acc gac agg teg ggc gcg 1056 
Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala 
340 345 350 

ccc acc tac age tgg ggt gaa aat gat acg gac gtc ttc gtc ctt aac 1104 
Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val Leu Asn 
355 360 365 

aat acc agg cca ccg ctg ggc aat tgg ttc ggt tgt acc tgg atg aac 1152 
Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn 
370 375 380 



tea act gga ttc acc aaa gtg tgc gga gcg cct cct tgt gtc ate gga 
Ser Thr Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Cys Val He Gly 
385 390 395 400 



1200 



ggg gcg ggc aac aac acc ctg cac tgc ccc act gat tgc ttc cgc aag 1248 
Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Phe Arg Lys 
405 410 415 

cat ccg gac gec aca tac tct egg tgc ggc tec ggt ccc tgg ate aca 1296 
His Pro Asp Ala Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp He Thr 
420 425 430 

ccc agg tgc ctg gtc gac tac ccg tat agg ctt tgg cat tat cct tgt 1344 
Pro Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
435 440 445 

acc ate aac tac act ata ttt aaa ate. agg atg tac gtg gga ggg gtc 1392 
Thr lie Asn Tyr Thr He Phe Lys lie Arg Met Tyr Val Gly Gly Val 

450 455 460 

gag cac agg ctg gaa get gec tgc aac tgg acg egg ggc gaa cgt tgc 1440 
Glu His Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys 
465 470 475 480 

gat ctg gaa gat agg gac agg tec gag etc age ccg tta ctg ctg acc 1488 
Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr 
4B5 490 495 

act aca cag tgg cag gtc etc ccg tgt tec ttc aca acc ctg cca gec 1536 
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Thr Thr Gin Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala 
500 505 510 

ttg tec acc ggc etc ate cac etc cac cag aac att gtg gac gtg cag 1584 
Leu Ser Thr Gly Leu lie His Leu His Gin Asn lie Val Asp Val Gin 
515 520 525 

tac ttg tac ggg gtg ggg tea age ate gcg tec tgg gee att aag tgg 1632 
Tyr Leu Tyr Gly Val Gly Ser Ser He Ala Ser Trp Ala He Lys Trp 
530 535 540 

gag tac gtc gtc etc ctg ttc ctt ctg ctt gca gac gcg cgc gtc tgc 1680 
Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
545 550 555 560 

tec tgc ttg tgg atg atg eta etc ata tec caa gcg gaa gcg get ttg 1728 
Ser Cys Leu Trp Met Met Leu Leu He Ser Gin Ala Glu Ala Ala Leu 
565 570 575 

gag aac etc gta ata ctt aat gca gca tec ctg gec ggg acg cac ggt 1776 
Glu Asn Leii Val He Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
580 585 590 

ctt gta tec ttc etc gtg ttc ttc tgc ttt gca tgg tat ctg aag ggt 1824 
Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
595 600 605 

aag tgg gtg ccc gga gcg gtc tac acc ttc tac ggg atg tgg cct etc 1872 
Lys Trp Val Pro Gly Ala Val Tyr Thr Phe Tyr Gly Met Trp Pro Leu 
610 615 620 

etc ctg etc ctg ttg gcg ttg ccc cag egg gcg tac gcg taa 1914 
Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala 
625 630 635 



<210> 4 
<211> 637 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: HCV-1 El/E2/p7 region 
<400> 4 

Ser Phe Ser He Phe Leu Leu Ala Leu Leu Ser Cys Leu Thr Val Pro 
15 10 15 

Ala Ser Ala Tyr Gin Val Arg Asn Ser Thr Gly Leu Tyr His Val Thr 
20 25 30 

Asn Asp Cys Pro Asn Ser Ser He Val Tyr Glu Ala Ala Asp Ala He 
35 40 45 

Leu His Thr Pro Gly Cys Val Pro Cys Val Arg Glu Gly Asn Ala Ser 
50 55 60 

Arg Cys Trp Val Ala Met Thr Pro Thr Val Ala Thr Arg Asp Gly Lys 
65 70 75 80 

Leu Pro Ala Thr Gin Leu Arg Arg His He Asp Leu Leu Val Gly Ser 
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85 90 95 

Ala Thr Leu Cys Ser Ala Leu Tyr Val Gly Asp Leu Cys Gly Ser Val 
100 105 HO 

Phe Leu Val Gly Gin Leu Phe Thr Phe Ser Pro Arg Arg His Trp Thr 
115 120 125 

Thr Gin Gly Cys Asn Cys Ser He Tyr Pro Gly His He Thr Gly His 
130 135 140 

Arg Met Ala Trp Asp Met Met Met Asn Trp Ser Pro Thr Thr Ala Leu 
145 150 155 160 

Val Met Ala Gin Leu Leu Arg He Pro Gin Ala He Leu Asp Met He 
165 170 175 

Ala Gly Ala His Trp Gly Val Leu Ala Gly lie Ala Tyr Phe Ser Met 
180 185 190 

Val Gly Asn Trp Ala Lys Val Leu Val Val Leu Leu Leu Phe Ala Gly 
195 200 205 

Val Asp Ala Glu Thr His Val Thr Gly Gly Ser Ala Gly His Thr Val 
210 215 220 

Ser Gly Phe Val Ser Leu Leu Ala Pro Gly Ala Lys Gin Asn Val Gin 
225 230 235 240 

Leu He Asn Thr Asn Gly Ser Trp His Leu Asn Ser Thr Ala Leu Asn 
245 250 255 

Cys Asn Asp Ser Leu Asn Thr Gly Trp Leu Ala Gly Leu Phe Tyr His 
260 265 270 

His Lys Phe Asn Ser Ser Gly Cys Pro Glu Arg Leu Ala Ser Cys Arg 
275 280 1 285 

Pro Leu Thr Asp Phe Asp Gin Gly Trp Gly Pro He Ser Tyr Ala Asn 
290 295 300 

Gly Ser Gly Pro Asp Gin Arg Pro Tyr Cys Trp His Tyr Pro Pro Lys 
305 310 315 320 

Pro Cys Gly He Val Pro Ala Lys Ser Val Cys Gly Pro Val Tyr Cys 

325 330 335 

Phe Thr Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ser Gly Ala 

340 345 350 

Pro Thr Tyr Ser Trp Gly Glu Asn Asp Thr Asp Val Phe Val Leu Asn 
355 360 365 

Asn Thr Arg Pro Pro Leu Gly Asn Trp Phe Gly Cys Thr Trp Met Asn 
370 375 380 

Ser Thr Gly Phe Thr Lys Val Cys Gly Ala Pro Pro Cys Val He Gly 
385 390 395 400 

Gly Ala Gly Asn Asn Thr Leu His Cys Pro Thr Asp Cys Phe Arg Lys 
405 410 415 
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His Pro Asp Ala Thr Tyr Ser Arg Cys Gly Ser Gly Pro Trp lie Thr 
420 425 430 

Pro Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu Trp His Tyr Pro Cys 
435 440 445 

Thr He Asn Tyr Thr He Phe Lys He Arg Met Tyr Val Gly Gly Val 
450 455 460 

Glu His Arg Leu Glu Ala Ala Cys Asn Trp Thr Arg Gly Glu Arg Cys 
465 470 475 480 

Asp Leu Glu Asp Arg Asp Arg Ser Glu Leu Ser Pro Leu Leu Leu Thr 
485 490 495 

Thr Thr Gin Trp Gin Val Leu Pro Cys Ser Phe Thr Thr Leu Pro Ala 
500 505 510 

Leu Ser Thr Gly Leu He His Leu Hfs Gin Asn He Val Asp Val Gin 
515 520 525 

Tyr Leu Tyr Gly Val Gly Ser Ser He Ala Ser Trp Ala He Lys Trp 
530 535 540 

Glu Tyr Val Val Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg Val Cys 
545 550 555 560 

Ser Cys Leu Trp Met Met Leu Leu He Ser Gin Ala Glu Ala Ala Leu 
565 570 575 

Glu Asn Leu Val He Leu Asn Ala Ala Ser Leu Ala Gly Thr His Gly 
580 585 590 

Leu Val Ser Phe Leu Val Phe Phe Cys Phe Ala Trp Tyr Leu Lys Gly 
595 600 605 

Lys Trp Val Pro Gly Ala Val Tyr Thr Phe Tyr Gly Met Trp Pro Leu 
610 615 620 

Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala 
625 630 635 



<210> 5 
<211> 21 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: consensus sequence 



Gly Ser Ala Ala Arg Thr Thr Ser Gly Phe Val Ser Leu Phe Ala Pro 



<400> 



5 



1 



5 



10 



15 



Gly Ala Lys Gin Asn 
20 



<210> 6 
<211> 20 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: exemplary CpG 
oligonucleotide 

<400> 6 

tccatgacgt tcctgacgtt 



<210> 7 
<211> 5676 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
NS345Core fusion protein 

<220> 

<221> CDS 

<222> (1) . . (5676) 

<400> 7 

atg get gca tat gca get cag ggc tat aag gtg eta gta etc aac ccc 48 

Met Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 
15 10 15 

tct gtt get gca aca ctg ggc ttt ggt get tac atg tec aag get cat 96 
Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His 
20 25 30 

ggg ate gat cct aac ate agg acc ggg gtg aga aca att ace act ggc 144 
Gly lie Asp Pro Asn He Arg Thr Gly Val Arg Thr He Thr Thr Giy 
35 40 45 

age ccc ate acg tac tec acc tac ggc aag ttc ctt gec gac ggc ggg 192 

Ser Pro He Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly 

50 55 60 

tgc teg ggg ggc get tat gac ata ata att tgt gac gag tgc cac tec 240 

Cys Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser 
65 70 75 80 

acg gat gee aca tec ate ttg ggc att ggc act gtc ctt gac caa gca 288 
Thr Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala 
85 90 95 

gag act gcg ggg gcg aga ctg gtt gtg etc gee acc gec acc cct ccg 336 
Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro 
100 105 HO 

ggc tec gtc act gtg ccc cat ccc aac ate gag gag gtt get ctg tec 384 
Gly Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser 
115 120 125 

acc acc gga gag ate cct ttt tac ggc aag get ate ccc etc gaa gta 432 
Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val 
130 135 140 

ate aag ggg ggg aga cat etc ate ttc tgt cat tea aag aag aag tgc 480 
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lie Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys 
145 150 155 160 

gac gaa etc gec gca aag ctg gtc gca ttg ggc ate aat gee gtg gee 
Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala 
165 170 175 



age ctt gac cct ace ttc ace att gag aca ate acg etc ccc caa gat 
Ser Leu Asp Pro Thr Phe Thr He Glu Thr He Thr Leu Pro Gin Asp 
225 230 235 240 

get gtc tec cgc act caa cgt egg ggc agg act ggc agg ggg aag cca 
Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
245 250 255 



528 



tac tac cgc ggt ctt gac gtg tec gtc ate ccg acc age ggc gat gtt 576 
Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 
180 185 190 

gtc gtc gtg gca acc gat gee etc atg acc ggc tat acc ggc gac ttc 624 
Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
195 200 205 

gac teg gtg ata gac tgc aat acg tgt gtc acc cag aca gtc gat ttc 672 
Asp Ser Val He Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
210 215 220 



720 



768 



ggc ate tac aga ttt gtg gca ccg ggg gag cgc ccc tec ggc atg ttc 816 
Gly He Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe 
260 265 270 

gac teg tec gtc etc tgt gag tgc tat gac gca ggc tgt get tgg tat 864 
Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr 
275 280 285 

gag etc acg ccc gee gag act aca gtt agg eta cga gcg tac atg aac 912 
Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn 
290 295 300 

acc ccg ggg ctt ccc gtg tgc cag gac cat ctt gaa ttt tgg gag ggc 960 
Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly 
305 310 315 320 

gtc ttt aca ggc etc act cat ata gat gee cac ttt eta tec cag aca 
Val Phe Thr Gly Leu Thr His He Asp Ala His Phe Leu Ser Gin Thr 
325 330 335 

aag cag agt ggg gag aac ctt cct tac ctg gta gcg tac caa gee acc 
Lys Gin Ser Gly Glu Asn Leu Pro Tyr Leu Val Ala Tyr Gin Ala Thr 
340 345 350 

gtg tgc get agg get caa gee cct ccc cca teg tgg gac cag atg tgg 1104 
Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp 
355 360 365 

aag tgt ttg att cgc etc aag ccc acc etc cat ggg cca aca ccc ctg 1152 
Lys Cys Leu - He Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu 
370 375 380 



1008 



1056 



eta tac aga ctg ggc get gtt cag aat gaa ate acc ctg acg cac cca 
Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu He Thr Leu Thr His Pro 



1200 
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385 390 395 400 

gtc acc aaa tac ate atg aca tgc atg teg gec gac ctg gag gtc gtc 1248 
Val Thr Lys Tyr lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val 
405 410 415 

acg age acc tgg gtg etc gtt ggc ggc gtc ctg get get ttg gee gcg 1296 
Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
420 425 430 

tat tgc ctg tea aca ggc tgc gtg gtc ata gtg ggc agg gtc gtc ttg 1344 
Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly Arg Val Val Leu 
435 440 445 

tec ggg aag ccg gca ate ata cct gac agg gaa gtc etc tac cga gag 1392 
Ser Gly Lys Pro Ala lie He Pro Asp Arg Glu Val Leu Tyr Arg Glu 
450 455 460 

ttc gat gag atg gaa gag tgc tct cag cac tta ccg tac ate gag caa 144 0 
Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin 
465 470 475 480 

ggg atg atg etc gee gag cag ttc aag cag aag gec etc ggc etc ctg 1488 
Gly Met Met Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu 
485 490 495 

cag acc gcg tec cgt cag gca gag gtt ate gee cct get gtc cag acc 1536 
Gin Thr Ala Ser Arg Gin Ala Glu Val He Ala Pro Ala Val Gin Thr 
500 505 510 

aac tgg caa aaa etc gag acc ttc tgg gcg aag cat atg tgg aac ttc 1584 
Asn Trp Gin Lys Leu Glu Thr Phe Trp Ala Lys His Met Trp Asn Phe 
515 520 525 

ate agt ggg ata caa tac ttg gcg ggc ttg tea acg ctg cct ggt aac 1632 
He Ser Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn 
530 535 540 

ccc gec att get tea ttg atg get ttt aca get get gtc acc age cca 1680 
Pro Ala He Ala Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro 
545 550 555 560 

eta acc act age caa acc etc etc ttc aac -ata ttg ggg ggg tgg gtg 1728 
Leu Thr Thr Ser Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val 
565 570 575 

get gec cag etc gee gee ccc ggt gee get act gec ttt gtg ggc get 1776 
Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala 
580 585 590 

ggc tta get ggc gee gee ate ggc agt gtt gga ctg ggg aag gtc etc 1824 
Gly Leu Ala Gly Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu 
595 600 605 

ata gac ate ctt gca ggg tat ggc gcg ggc gtg gcg gga get ctt gtg 1872 
He Asp He Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val 
610 615 ^ 620 

gca ttc aag ate atg age ggt gag gtc ccc tec acg gag gac ctg gtc 1920 
Ala Phe Lys He Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val 
625 630 635 640 
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aat eta ctg ccc gec ate etc teg ccc gga gec etc gta gtc ggc gtg 1968 
Asn Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val 
645 650 655 

gtc tgt gca gca ata ctg cgc egg cac gtt ggc ccg ggc gag ggg gca 2016 
Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala 
660 665 670 

gtg cag tgg atg aac egg ctg ata gec ttc gee tec egg ggg aac cat 2064 
Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His 
675 680 685 

gtt tec ccc acg cac tac gtg ccg gag age gat gca get gee cgc gtc 2112 
Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val 
690 695 700 



act gee ata etc age age etc act gta ace cag etc ctg agg cga ctg 
Thr Ala He Leu Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu 
705 710 715 720 

cac cag tgg ata age teg gag tgt acc act cca tgc tec ggt tec tgg 
His Gin Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp 
725 730 735 



acc tgg eta aaa get aag etc atg cca cag ctg cct ggg ate ccc ttt 
Thr Trp Leu Lys Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe 
755 760 765 



atg act act gac aat ctt aaa tgc ccg tgc cag gtc cca teg ccc gaa 
Met Thr Thr Asp Asn Leu Lys Cys Pro Cys Gin Val Pro Ser Pro Glu 
865 870 875 880 



2160 



2208 



eta agg gac ate tgg gac tgg ata tgc gag gtg ttg age gac ttt aag 2256 
Leu Arg Asp He Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys 
740 745 750 



2304 



gtg tec tgc cag cgc ggg tat aag ggg gtc tgg cga ggg gac ggc ate 2352 
Val Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly He 
770 775 780 



2400 



atg cac act cgc tgc cac tgt gga get gag ate act gga cat gtc aaa 
Met His Thr Arg Cys His Cys Gly Ala Glu He Thr Gly His Val Lys 
785 790 795 800 

aac ggg acg atg agg ate gtc ggt cct agg acc tgc agg aac atg tgg 2448 
Asn Gly Thr Met Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp 
805 810 815 

agt ggg acc ttc ccc att aat gee tac acc acg ggc ccc tgt acc ccc 2496 
Ser Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro 
820 825 830 

ctt cct gcg ccg aac tac acg ttc gcg eta tgg agg gtg tct gca gag 2544 
Leu Pro Ala Pro Asn Tyr Thr Phe Ala Leu Trp Arg Val Ser Ala Glu 
835 840 845 

gaa tac gtg gag ata agg cag gtg ggg gac ttc cac tac gtg acg ggt 2592 
Glu Tyr Val Glu He Arg Gin Val Gly Asp Phe His Tyr Val Thr Gly 
850 855 860 



2640 
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ttt ttc aca gaa ttg gac ggg gtg cgc eta cat agg ttt gcg ccc ccc 
Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro 
885 890 895 



2688 



tgc aag ccc ttg ctg egg gag gag gta tea ttc aga gta gga etc cac 2736 
Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His 
900 905 910 

gaa tac ccg gta ggg teg caa tta cct tgc gag ccc gaa ccg gac gtg 2784 
Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val 
915 920 925 

gec gtg ttg acg tec atg etc act gat ccc tec cat ata aca gca gag 2832 
Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His He Thr Ala Glu 
930 935 940 



gcg gee ggg cga agg ttg gcg agg gga tea ccc ccc tct gtg gec age 
Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala Ser 
950 955 960 



945 



tgg agg cag gag atg ggc ggc aac ate acc agg gtt gag tea gaa aac 
Trp Arg Gin Glu Met Gly Gly Asn He Thr Arg Val Glu Ser Glu Asn 
995 1000 . 1005 



gec gag etc gec acc aga age ttt ggc age tec tea act tec ggc att 

Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly He 

1105 1110 1H5 H20 

acg ggc gac aat acg aca aca tec tct gag ccc gee cct tct ggc tgc 



2880 



2928 



tec teg get age cag eta tec get cca tct etc aag gca act tgc acc 
Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr 
965 970 975 

get aac cat gac tec cct gat get gag etc ata gag gee aac etc eta 2976 
Ala Asn His Asp Ser Pro Asp Ala Glu Leu He Glu Ala Asn Leu Leu 
980 985 990 



3024 



aaa gtg gtg att ctg gac tec ttc gat ccg ctt gtg gcg gag gag gac 3072 
Lys Val Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp 
1010 1015 1020 

gag egg gag ate tec gta ccc gca gaa ate ctg egg aag tct egg aga 3120 
Glu Arg Glu He Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg 
1025 1030 1035 1040 

ttc gee cag gee ctg ccc gtt tgg gcg egg ccg gac tat aac ccc ccg 3168 
Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro 
1045 1050 1055 

eta gtg gag acg tgg aaa aag ccc gac tac gaa cca cct gtg gtc cat 3216 
Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His 
1060 1065 1070 

ggc tgc ccg ctt cca cct cca aag tec cct cct gtg cct ccg cct egg 3264 
Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro Arg 
1075 1080 1085 

aag aag egg acg gtg gtc etc act gaa tea acc eta tct act gec ttg 3312 
Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu 
1090 1095 HOO 



3360 



3408 
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Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
1125 1130 1135 

ccc ccc gac tec gac get gag tec tat tec tec atg ccc ccc ctg gag 
Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu 
1140 1145 1150 

ggg gag cct ggg gat ccg gat ctt age gac ggg tea tgg tea acg gtc 
Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val 
1155 1160 1165 

agt agt gag gee aac gcg gag gat gtc gtg tgc tgc tea atg tct tac 
Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser Tyr 
1170 1175 HBO 

tct tgg aca ggc gca etc gtc acc ccg tgc gee gcg gaa gaa cag aaa 
Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys 
1185 H90 H95 1200 

ctg ccc ate aat gca eta age aac teg ttg eta cgt cac cac aat ttg 
Leu Pro He Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu 
1205 ' 1210 1215 

gtg tat tec acc acc tea cgc agt get tgc caa agg cag aag aaa gtc 
Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val 
1220 1225 1230 

aca ttt gac aga ctg caa gtt ctg gac age cat tac cag gac gta etc 
Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu 
1235 1240 1245 

aag gag gtt aaa gca gcg gcg tea aaa gtg aag get aac ttg eta tec 
Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser 
1250 1255 1260 

gta gag gaa gct> tgc age ctg acg ccc cca cac tea gee aaa tec aag 
Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys 
1265 1270 1275 1280 

ttt ggt tat ggg gca aaa gac gtc cgt tgc cat gee aga aag gee gta 
Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val 
1285 1290 1295 

acc cac ate aac tec gtg tgg aaa gac ctt ctg gaa gac aat gta aca 
Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr 
1300 1305 1310 

cca ata gac act acc ate atg get aag aac gag gtt ttc tgc gtt cag 
Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin 
1315 1320 1325 

cct gag aag ggg ggt cgt aag cca get cgt etc ate gtg ttc ccc gat 
Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp 
1330 1335 1340 

ctg ggc gtg cgc gtg tgc gaa aag atg get ttg tac gac gtg gtt aca 
Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr 
1345 1350 1355 1360 

aag etc ccc ttg gec gtg atg gga age tec tac gga ttc caa tac tea 
Lys Leu Pro Leu Ala Val Met Gly Ser Ser Tyr Gly Phe Gin Tyr Ser 



3456 



3504 



3552 



3600 



3648 



3696 



3744 



3792 



3840 



3888 



3936 



3984 



4032 



4080 



4128 



12 



WO 2004/039950 



PCT/US2003/033610 



1365 



1370 



1375 



cca gga cag egg gtt gaa ttc etc gtg caa gcg tgg aag tec aag aaa 
Pro Gly Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys 
1380 1385 1390 

ace cca atg ggg ttc teg tat gat ace cgc tgc ttt gac tec aca gtc 
Thr Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val 
1395 1400 1405 

act gag age gac ate cgt acg gag gag gca ate tac caa tgt tgt gac 
Thr Glu Ser Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp 
1410 1415 1420 

etc gac ccc caa gec cgc gtg gee ate aag tec etc ace gag agg ctt 
Leu Asp Pro Gin Ala Arg Val , Ala He Lys ;Ser Leu Thr Glu Arg Leu 
1425 1430 1435 1440 

tat gtt ggg ggc cct ctt ace aat tea agg ggg gag aac tgc ggc tat 
Tyr Val Gly Gly Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr 
1445 1450 1455 

cgc agg tgc cgc gcg age ggc gta ctg aca act age tgt ggt aac ace 
Arg Arg Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr 
1460 1465 1470 

etc act tgc tac ate aag gee egg gca gec tgt cga gec gca ggg etc 
Leu Thr Cys Tyr He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 
1475 1480 14B5 

cag gac tgc ace atg etc gtg tgt ggc gac gac tta gtc gtt ate tgt 
Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys 
1490 1495 1500 

gaa age gcg ggg gtc cag gag gac gcg gcg age ctg aga gee ttc acg 
Glu Ser Ala Gly Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr 
1505 1510 1515 1520 

gag get atg acc agg tac tec gee ccc cct ggg gac ccc cca caa cca 
Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro 
1525 1530 1535 

gaa tac gac ttg gag etc ata aca tea tgc tec tec aac gtg tea gtc 
Glu Tyr Asp Leu Glu Leu He Thr Ser Cys Ser Ser Asn Val Ser Val 
1540 1545 1550 

gec cac gac ggc get gga aag agg gtc tac tac etc acc cgt gac cct 
Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro 
1555 1560 1565 

aca acc ccc etc gcg aga get gcg tgg gag aca gca aga cac act cca 
Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro 
1570 1575 1580 

gtc aat tec tgg eta ggc aac ata ate atg ttt gec ccc aca ctg tgg 
Val Asn Ser Trp Leu Gly Asn lie lie Met Phe Ala Pro Thr Leu Trp 
1585 1590 1595 1600 

gcg agg atg ata ctg atg acc cat ttc ttt age gtc ctt ata gee agg 
Ala Arg Met lie Leu Met Thr His Phe Phe- Ser Val Leu He Ala Arg 
1605 1610 1615 



4176 



4224 



4272 



4320 



4368 



4416 



4464 



4512 



4560 



4608 
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4704 



4752 



4800 



4848 
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gac cag ctt gaa cag gcc etc gat tgc gag ate tac ggg gec tgc tac 
Asp Gin Leu Glu Gin Ala Leu Asp Cys Glu He Tyr Gly Ala Cys Tyr 
1620 1625 1630 



4896 



tec ata gaa cca ctg gat eta cct cca ate att caa aga etc cat ggc 4944 
Ser He Glu Pro Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly 
1635 1640 1645 

etc age gca ttt tea etc cac agt tac tct cca ggt gaa ate aat agg 4992 
Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg 
1650 1655 1660 

gtg gcc gca tgc etc aga aaa ctt ggg gta ccg ccc ttg cga get tgg 5040 
Val Ala Ala Cys Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp 
1665 1670 1675 1680 

aga cac egg gcc egg age gtc cgc get agg ctt ctg gcc aga gga ggc 5088 
Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu Leu Ala Arg Gly Gly 
1685 1690 1695 



agg get gcc ata tgt ggc aag tac etc ttc aac tgg gca gta aga aca 
Arg Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr 
1700 1705 1710 

aag etc aaa etc act cca ata gcg gcc get ggc cag ctg gac ttg tec 
Lys Leu Lys Leu Thr Pro He Ala Ala Ala Gly Gin Leu Asp Leu Ser 
1715 1720 1725 

ggc tgg ttc acg get ggc tac age ggg gga gac att tat cac age gtg 
Gly Trp Phe Thr Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val 
1730 1735 1740 

tct cat gcc egg ccc cgc tgg ate tgg ttt tgc eta etc ctg ctt get 
Ser His Ala Arg Pro Arg Trp He Trp Phe Cys Leu Leu Leu Leu Ala 
1745 1750 1755 1760 

aca ggg gta ggc ate tac etc etc ccc aac cga atg age acg aat cct 
Ala Glv Val Gly He Tyr Leu Leu Pro Asn Arg Met Ser Thr Asn Pro 
1765 1770 1775 

aaa cct caa aga aag acc aaa cgt aac ace aac egg egg ccg cag gac 
Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 
1780 1785 1790 

gtc aag ttc ccg ggt ggc ggt cag ate gtt ggt gga gtt tac ttg ttg 
Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
1795 1800 1805 

ccg cgc agg ggc cct aga ttg ggt gtg cgc gcg acg aga aag act tec 
Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser 
1810 1815 1820 

gag egg teg caa cct cga ggt aga cgt cag cct ate ccc aag get cgt 
Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin Pro He Pro Lys Ala Arg 
1825 1830 1835 1840 

egg ccc gag ggc agg acc tgg get cag ccc ggg tac cct tgg ccc etc 
Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
1845 1850 1855 



5136 
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tat ggc aat gag ggc tgc ggg tgg gcg gga tgg etc ctg tct ccc cgt 5616 
Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
1860 1865 1870 

ggc tct egg cct age tgg ggc ccc aca gac ccc egg cgt agg teg cgc 5664 
Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg 
1875 1880 1885 

aat ttg ggt aag 5676 
Asn Leu Gly Lys 
1890 



<210> 8 
<211> 1892 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
NS34 5Core fusion protein 

<400> 8 

Met Ala Ala Tyr Ala Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro 
15 10 15 

Ser Val Ala Ala Thr Leu Gly Phe Gly Ala Tyr Met Ser Lys Ala His 
20 25 30 

Gly lie Asp Pro Asn lie Arg Thr Gly Val Arg Thr lie Thr Thr Gly 
35 40 45 

Ser Pro lie Thr Tyr Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly 
50 55 60 

Cys Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser 
65 70 75 80 

Thr Asp Ala Thr Ser He Leu Gly He Gly Thr Val Leu Asp Gin Ala 
85 90 95 

Glu Thr Ala Gly Ala Arg Leu Val Val Leu Ala Thr Ala Thr Pro Pro 
100 105 HO 

Gly Ser Val Thr Val Pro His Pro Asn He Glu Glu Val Ala Leu Ser 
115 120 125 

Thr Thr Gly Glu He Pro Phe Tyr Gly Lys Ala He Pro Leu Glu Val 
130 135 140 

He Lys Gly Gly Arg His Leu He Phe Cys His Ser Lys Lys Lys Cys 
145 150 155 160 

Asp Glu Leu Ala Ala Lys Leu Val Ala Leu Gly He Asn Ala Val Ala 
165 170' 175 

Tyr Tyr Arg Gly Leu Asp Val Ser Val He Pro Thr Ser Gly Asp Val 
180 185 190 

Val Val Val Ala Thr Asp Ala Leu Met Thr Gly Tyr Thr Gly Asp Phe 
195 200 205 
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Asp Ser Val lie Asp Cys Asn Thr Cys Val Thr Gin Thr Val Asp Phe 
210 215 220 

Ser Leu Asp Pro Thr Phe Thr lie Glu Thr lie Thr Leu Pro Gin Asp 
225 230 235 240 

Ala Val Ser Arg Thr Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys Pro 
245 250 255 

Gly lie Tyr Arg Phe Val Ala Pro Gly Glu Arg Pro Ser Gly Met Phe 
260 265 270 

Asp Ser Ser Val Leu Cys Glu Cys Tyr Asp Ala Gly Cys Ala Trp Tyr 
275 280 285 

Glu Leu Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Met Asn 
290 295 300 

Thr Pro Gly Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Gly 
305 310 315, 320 

Val Phe Thr Gly Leu Thr His lie Asp Ala His Phe Leu Ser Gin Thr 
325 330 335 

Lys Gin Ser Gly Glu Asn Leu Pro Tyr Leu Val Ala Tyr Gin Ala Thr 
340 345 350 

Val Cys Ala Arg Ala Gin Ala Pro Pro Pro Ser Trp Asp Gin Met Trp 
355 360 365 

Lys Cys Leu lie Arg Leu Lys Pro Thr Leu His Gly Pro Thr Pro Leu 
370 375 380 

Leu Tyr Arg Leu Gly Ala Val Gin Asn Glu lie Thr Leu Thr His Pro 
385 390 395 400 

Val Thr Lys Tyr lie Met Thr Cys Met Ser Ala Asp Leu Glu Val Val 
405 410 415 

Thr Ser Thr Trp Val Leu Val Gly Gly Val Leu Ala Ala Leu Ala Ala 
420 425 430 

Tyr Cys Leu Ser Thr Gly Cys Val Val lie Val Gly Arg Val Val Leu 
435 440 445 

Ser Gly Lys Pro Ala lie lie Pro Asp Arg Glu Val Leu Tyr Arg Glu 
450 455 460 

Phe Asp Glu Met Glu Glu Cys Ser Gin His Leu Pro Tyr He Glu Gin 



465 



470 



475 



480 



Gly Met Met Leu Ala Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu 
485 t 490 495 

Gin Thr Ala Ser Arg Gin Ala Glu Val He Ala Pro Ala Val Gin Thr 
500 505 510 

Asn Trp Gin Lys Leu Glu Thr Phe Trp Ala Lys His Met Trp Asn Phe 
515 520 525 
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He Ser Gly He Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn 
530 535 540 

Pro Ala He Ala Ser Leu Met Ala Phe Thr Ala Ala Val Thr Ser Pro 
545 550 555 560 

Leu Thr Thr Ser Gin Thr Leu Leu Phe Asn He Leu Gly Gly Trp Val 
565 570 575 

Ala Ala Gin Leu Ala Ala Pro Gly Ala Ala Thr Ala Phe Val Gly Ala 
580 585 590 

Gly Leu Ala Gly Ala Ala He Gly Ser Val Gly Leu Gly Lys Val Leu 
595 600 605 

He Asp He Leu Ala Gly Tyr Gly Ala Gly Val Ala Gly Ala Leu Val 
610 615 620 

Ala Phe Lys lie Met Ser Gly Glu Val Pro Ser Thr Glu Asp Leu Val 
625 630 635 640 

Asn Leu Leu Pro Ala He Leu Ser Pro Gly Ala Leu Val Val Gly Val 
645 650 . 655 

Val Cys Ala Ala He Leu Arg Arg His Val Gly Pro Gly Glu Gly Ala 
660 665 670 

Val Gin Trp Met Asn Arg Leu He Ala Phe Ala Ser Arg Gly Asn His 
675 680 685 

Val Ser Pro Thr His Tyr Val Pro Glu Ser Asp Ala Ala Ala Arg Val 
690 695 700 

Thr Ala He Leu Ser Ser Leu Thr Val Thr Gin Leu Leu Arg Arg Leu 
705 710 715 720 

His Gin Trp He Ser Ser Glu Cys Thr Thr Pro Cys Ser Gly Ser Trp 
725 730 735 

Leu Arg Asp He Trp Asp Trp He Cys Glu Val Leu Ser Asp Phe Lys 
740 745 750 

Thr Trp Leu Lys Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe 
755 760 765 

Val Ser Cys Gin Arg Gly Tyr Lys Gly Val Trp Arg Gly Asp Gly He 
770 775 780 

Met His Thr Arg Cys His Cys Gly Ala Glu He Thr Gly His Val Lys 
785 790 795 800 

Asn Gly Thr Met Arg He Val Gly Pro Arg Thr Cys Arg Asn Met Trp 
805 810 815 

Ser Gly Thr Phe Pro He Asn Ala Tyr Thr Thr Gly Pro Cys Thr Pro 
820 825 830 

Leu Pro Ala Pro Asn Tyr Thr Phe Ala Leu Trp Arg Val Ser Ala Glu 
835 840 845 

Glu Tyr Val Glu He Arg Gin Val Gly Asp Phe His Tyr Val Thr Gly 
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850 B55 860 

Met Thr Thr Asp Asn Leu Lys Cys Pro Cys Gin Val Pro Ser Pro Glu 
865 870 875 880 

Phe Phe Thr Glu Leu Asp Gly Val Arg Leu His Arg Phe Ala Pro Pro 

885 890 895 

Cys Lys Pro Leu Leu Arg Glu Glu Val Ser Phe Arg Val Gly Leu His 

900 905 910 

Glu Tyr Pro Val Gly Ser Gin Leu Pro Cys Glu Pro Glu Pro Asp Val 
915 920 925 

Ala Val Leu Thr Ser Met Leu Thr Asp Pro Ser His lie Thr Ala Glu 
930 935 940 

Ala Ala Gly Arg Arg Leu Ala Arg Gly Ser Pro Pro Ser Val Ala Ser 
945 950 955 960 

Ser Ser Ala Ser Gin Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr 
965 970 975 

Ala Asn His Asp Ser Pro Asp Ala Glu Leu lie Glu Ala Asn Leu Leu 
980 985 990 

Trp Arg Gin Glu Met Gly Gly Asn lie Thr Arg Val Glu Ser Glu Asn 
995 1000 1005 

Lys Val Val lie Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp 
1010 1015 1020 

Glu Arg Glu He Ser Val Pro Ala Glu He Leu Arg Lys Ser Arg Arg 
1025 1030 1035 1040 

Phe Ala Gin Ala Leu Pro Val Trp Ala Arg Pro Asp Tyr Asn Pro Pro 
1045 1050 1055 

Leu Val Glu Thr Trp Lys Lys Pro Asp Tyr Glu Pro Pro Val Val His 
1060 1065 1070 

Gly Cys Pro Leu Pro Pro Pro Lys Ser Pro Pro Val Pro Pro Pro Arg 
1075 1080 1085 

Lys Lys Arg Thr Val Val Leu Thr Glu Ser Thr Leu Ser Thr Ala Leu 
1090 1095 1100 

Ala Glu Leu Ala Thr Arg Ser Phe Gly Ser Ser Ser Thr Ser Gly He 
1105 IHO 1120 

Thr Gly Asp Asn Thr Thr Thr Ser Ser Glu Pro Ala Pro Ser Gly Cys 
1125 ; 1130 H35 

Pro Pro Asp Ser Asp Ala Glu Ser Tyr Ser Ser Met Pro Pro Leu Glu 
1140 1145 H50 

Gly Glu Pro Gly Asp Pro Asp Leu Ser Asp Gly Ser Trp Ser Thr Val 
1155 1160 H65 

Ser Ser Glu Ala Asn Ala Glu Asp Val Val Cys Cys Ser Met Ser Tyr 
1170 1175 H80 
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Ser Trp Thr Gly Ala Leu Val Thr Pro Cys Ala Ala Glu Glu Gin Lys 
1185 1190 H95 1200 

Leu Pro lie Asn Ala Leu Ser Asn Ser Leu Leu Arg His His Asn Leu 
1205 1210 1215 

Val Tyr Ser Thr Thr Ser Arg Ser Ala Cys Gin Arg Gin Lys Lys Val 
1220 1225 1230 

Thr Phe Asp Arg Leu Gin Val Leu Asp Ser His Tyr Gin Asp Val Leu 
1235 1240 1245 

Lys Glu Val Lys Ala Ala Ala Ser Lys Val Lys Ala Asn Leu Leu Ser 
1250 1255 1260 

Val Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Lys Ser Lys 
1265 1270 1275 1280 

Phe Gly Tyr Gly Ala Lys Asp Val Arg Cys His Ala Arg Lys Ala Val 
1285 1290 1295 

Thr His He Asn Ser Val Trp Lys Asp Leu Leu Glu Asp Asn Val Thr 
1300 1305 13 1° 

Pro He Asp Thr Thr He Met Ala Lys Asn Glu Val Phe Cys Val Gin 
1315 1320 1325 

Pro Glu Lys Gly Gly Arg Lys Pro Ala Arg Leu He Val Phe Pro Asp 
1330 1335 1340 

Leu Gly Val Arg Val Cys Glu Lys Met Ala Leu Tyr Asp Val Val Thr 
1345 1350 • : 1355 1360 

Lys Leu Pro Leu Ala Val Met Gly Ser Ser. Tyr Gly Phe Gin Tyr Ser 
1365 1370 1375 

Pro Gly Gin Arg Val Glu Phe Leu Val Gin Ala Trp Lys Ser Lys Lys 
1380 1385 1390 

Thr Pro Met Gly Phe Ser Tyr Asp Thr Arg Cys Phe Asp Ser Thr Val 
1395 1400 1405 

Thr Glu Ser Asp He Arg Thr Glu Glu Ala He Tyr Gin Cys Cys Asp 
1410 1415 1420 

Leu Asp Pro Gin Ala Arg Val Ala He Lys Ser Leu Thr Glu Arg Leu 
1425 1430 1435 1440 

Tyr Val Gly Gly Pro Leu Thr Asn Ser Arg Gly Glu Asn Cys Gly Tyr 
1445 1450 1455 

Ara Ara Cys Arg Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr 
1460 1465 1470 

Leu Thr Cys Tyr He Lys Ala Arg Ala Ala Cys Arg Ala Ala Gly Leu 
1475 1480 1485 

Gin Asp Cys Thr Met Leu Val Cys Gly Asp Asp Leu Val Val He Cys 
1490 1495 1500 

Glu Ser Ala Gly Val Gin Glu Asp Ala Ala Ser Leu Arg Ala Phe Thr 
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1505 1510 1515 1520 

Glu Ala Met Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin Pro 
1525 1530 1535 

Glu Tyr Asp Leu Glu Leu lie Thr Ser Cys Ser Ser Asn Val Ser Val 
1540 1545 1550 

Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr Arg Asp Pro 
1555 1560 1565 

Thr Thr Pro Leu Ala Arg Ala Ala Trp Glu Thr Ala Arg His Thr Pro 
1570 1575 1580 

Val Asn Ser Trp Leu Gly Asn He He Met Phe Ala Pro Thr Leu Trp 
1585 1590 1595 1600 

Ala Arg Met He Leu Met Thr His Phe Phe Ser Val Leu He Ala Arg 
1605 1610 1615 

Asp Gin Leu Glu Gin Ala Leu Asp Cys Glu He Tyr Gly Ala Cys Tyr 
1620 1625 1630 

Ser He Glu Pro Leu Asp Leu Pro Pro He He Gin Arg Leu His Gly 
1635 1640 1645 

Leu Ser Ala Phe Ser Leu His Ser Tyr Ser Pro Gly Glu He Asn Arg 
1650 1655 1660 

Val Ala Ala Cys Leu Arg Lys Leu Gly Val Pro Pro Leu Arg Ala Trp 
1665 1670 1675 1680 

Arg His Arg Ala Arg Ser Val Arg Ala Arg Leu Leu Ala Arg Gly Gly 
1685 1690 1695 

Arg Ala Ala He Cys Gly Lys Tyr Leu Phe Asn Trp Ala Val Arg Thr 
1700 1705 1710 

Lys Leu Lys Leu Thr Pro He Ala Ala Ala Gly Gin Leu Asp Leu Ser 
1715 1720 1725 

Gly Trp Phe Thr Ala Gly Tyr Ser Gly Gly Asp He Tyr His Ser Val 
1730 1735 1740 

Ser His Ala Arg Pro Arg Trp He Trp Phe Cys Leu Leu Leu Leu Ala 
1745 1750 1755 1760 

Ala Gly Val Gly He Tyr Leu Leu Pro Asn Arg Met Ser Thr Asn Pro 
1765 1770 1775 

Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr Asn Arg Arg Pro Gin Asp 
1780 1785 1790 

Val Lys Phe Pro Gly Gly Gly Gin He Val Gly Gly Val Tyr Leu Leu 
1795 1800 1805 

Pro Arg Arg Gly Pro Arg Leu Gly Val Arg Ala Thr Arg Lys Thr Ser 
1810 1815 1820 

Glu Arg Ser Gin Pro Arg Gly Arg Arg Gin. Pro He Pro Lys Ala Arg 
1825 1830 1835 1840 
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Arg Pro Glu Gly Arg Thr Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu 
1845 1850 1855 

Tyr Gly Asn Glu Gly Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg 
1860 1865 1870 

Gly Ser Arg Pro Ser Trp Gly Pro Thr Asp Pro Arg Arg Arg Ser Arg 
1875 1880 1885 

Asn Leu Gly Lys 
1890 



<210> 9 
<211> 546 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
native NS3 protease domain 

<220> 

<221> CDS 

<222> (1) . . (546) 

<400> 9 

atg gcg ccc ate acg gcg tac gec cag cag aca agg ggc etc eta ggg 48 
Met Ala Pro lie Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 



tgc ata ate acc age eta act ggc egg gac aaa aac caa gtg gag ggt 

Cys He He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly 
20 25 * 30 

gag gtc cag att gtg tea act get gec caa acc ttc ctg gca acg tgc 

Glu Val Gin He Val Ser Thr Ala Ala Gin Thr Phe Leu Ala Thr Cys 
35 40 45 



96 



144 



ate aat ggg gtg tgc tgg act gtc tac cac ggg gec gga acg agg acc 192 
He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
50 55 60 

ate gcg tea ccc aag ggt cct gtc ate cag atg tat acc. aat gta gac 240 
He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val Asp 
65 70 75 80 

caa gac ctt gtg ggc tgg ccc get ccg caa ggt age cga tea ttg aca 288 
Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr 
85 90 95 

ccc tgc act tgc ggc tec teg gac ctt tac ctg gtc acg agg cac gec 336 
Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 110 

gat gtc att ccc gtg cgc egg egg ggt gat age agg ggc age ctg ctg 384 
Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 120 125 

teg ccc egg ccc att tec tac ttg aaa ggc tec teg ggg ggt ccg ctg 432 
Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
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130 135 140 

ttg tgc ccc gcg ggg cac gcc gtg ggc ata ttt agg gcc gcg gtg tgc 480 
Leu Cys Pro Ala Gly His Ala Val Gly lie Phe Arg Ala Ala Val Cys 
145 150 155 160 

acc cgt gga gtg get aag gcg gtg gac ttt ate cct gtg gag aac eta 528 
Thr Arg Gly Val Ala Lys Ala Val Asp Phe lie Pro Val Glu Asn Leu 
165 170 175 

gag aca acc atg agg tec 546 
Glu Thr Thr Met Arg Ser 
180 



<210> 10 
<211> 182 • 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: representative 
native NS3 protease domain 

<400> 10 

Met Ala Pro He Thr Ala Tyr Ala Gin Gin Thr Arg Gly Leu Leu Gly 
15 10 15 

Cys He He Thr Ser Leu Thr Gly Arg Asp Lys Asn Gin Val Glu Gly 
20 25 30 

Glu Val Gin He Val Ser Thr Ala Ala Gin Thr Phe Leu Ala Thr Cys 
35 40 45 

He Asn Gly Val Cys Trp Thr Val Tyr His Gly Ala Gly Thr Arg Thr 
50 55 60 

He Ala Ser Pro Lys Gly Pro Val He Gin Met Tyr Thr Asn Val Asp 
65 70 i 75 80 

Gin Asp Leu Val Gly Trp Pro Ala Pro Gin Gly Ser Arg Ser Leu Thr 
85 90 95 

Pro Cys Thr Cys Gly Ser Ser Asp Leu Tyr Leu Val Thr Arg His Ala 
100 105 HO 

Asp Val He Pro Val Arg Arg Arg Gly Asp Ser Arg Gly Ser Leu Leu 
115 * 120 125 

Ser Pro Arg Pro He Ser Tyr Leu Lys Gly Ser Ser Gly Gly Pro Leu 
130 135 140 

Leu Cys Pro Ala Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 
145 150 155 160 

Thr Arg Gly Val Ala Lys Ala Val Asp Phe He Pro Val Glu Asn Leu 
165 170 175 

Glu Thr Thr Met Arg Ser 
180 
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